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NOVEL MAMMALIAN G-PROTEIN COUPLED RECEPTORS HAVING 
EXTRACELLULAR LEUCINE RICH REPEAT REGIONS 

Introduction 

5 Field of the Invention 

The field of this invention is the G-protein coupled receptor family of proteins. 
Background 

Gonadotropins (Luteinizing hormone, LH; follicle stimulating hormone, FSH; 
chorionic gonadotropin, CG) and thyrotropin (TSH)) are essential for the growth and 
1 0 differentiation of gonads and thyroid gland, respectively. These glycoprotein hormones 
bind specific target cell receptors on the plasma membrane to activate the cAMP-protein 
kinase A pathway. 

The receptors for LH, FSH and TSH belong to the large G-protein-coupled, seven- 
trans-membrane protein family but are unique in having a large N-terminal extra-cellular 

1 S (ecto-) domain containing leucine-rich repeats important for interaction with large 

glycoprotein ligands. Studies suggest that in these receptors, the extra-cellular leucine rich 
repeat region serves as a "baseball glove" which efficiently catches its corresponding large 
hormone ligand and optimally orients it for interaction with the seven trans-membrane- 
helical domain of the receptor. 

20 Because hormones and receptors play a prominent role in a variety of 

physiological processes, there is continued interest in the identification of novel receptors 
and their ligands, as well as the genes encoding the same. 
Relevant Literature 

References of interest include: el Tayar, N, "Advances in the Molecular 

25 Understanding of Gonadotropins-Receptors Interactions," Mol. Cell. Endocrinol. 

(December 20, 1996). 125: 65-70; Bhowmick et al., "Determination of Residues Important 
in Hormone Binding to the Extracellular Domain of the Luteini2dng Hormone/Chorionic 
Gonadotropin Receptor by Site-Directed Mutagenesis and Modeling," Mol. Endocrinol. 
(September 1996) 10: 1 147-1 159; Thomas et al., "Mutational Analyses of the 

30 Extracellular Domain of the FuU-Length Lutropin/Choriogonadotropin Receptor Suggest 
Leucine-Rich Repeats 1-6 are Involved in Hormone Binding," Mol. Endocrinol. (June 
1996) 10:760-768; SegaloflF& Ascoli, "The Gonadotropin Receptors: Insights firom the 
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Cloning of their cDNAs " Oxf. Rev. Reprod. Biol. (1992) 14: 141-168; Braun et al., 
"Amino-Tenninal Leucine-Rich Repeats in Gonadotropin Receptors Determine Hormone 
Selectivity," EMBO J (July 1991) 10: 1885-1890; and SegalofiF et al., "Structure of the 
Lutropin/Choriogonadotropin Receptor," Recent Prog. Horm. Res. (1990) 46: 261-301. 

5 

Summary of the Invention 
Three novel mammalian G-protein coupled receptors having extra-cellular leucine 
rich repeat domains, i.e. LGR4, LGR5 and LGR7, and polypeptide compositions related 
thereto, as well as nucleotide compositions encoding the same, are provided. The subject 

10 proteins, polypeptide and nucleic acid compositions find use in a variety of different 
applications, including the identification of homologous or related genes; the production 
of compositions that modulate the expression or fimction of the subject proteins; in the 
identification of endogenous ligands for the subject orphan receptors; in the generation of 
functional binding proteins for the neutralization of the actions of endogenous ligands; in 

IS gene therapy; in mapping fimctional regions of the protein; and in studying associated 
physiological pathways. In addition, modulation of the gene activity in vivo is used for 
prophylactic and therapeutic purposes, and the like. 

Brief Description of the Figures 
20 Fig. 1 provides the nucleotide and amino acid sequence for human LGR4. 

Fig. 2 provides the nucleotide and amino acid sequence for human LGR5. 
Fig. 3 provides the nucleotide and amino acid sequence for human LGR7, long 

form. 

Fig. 4 provides the nucleotide and amino acid sequence for human LGR7, short 

25 form. 

Fig. 5 provides an alignment comparison of the long and short forms of LGR7. 
Figs. 6 provides a comparison of deduced amino acid sequence of LGR4 and 5 
cDNAs and those encoding FSH and LH receptors. 

30 



2 
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Description of the Specific Embodiments 
Novel mammalian G-protein coupled receptors having extra-cellular leucine rich 
repeat regions (i.e. LGR4, LGR5 and LGR7) and polypeptide compositions related 
thereto, as well as nucleic acid compositions encoding the same, are provided. The 
5 subject polypeptide and/or nucleic acid compositions find use in a variety of different 
applications, including the identification of homologous or related genes; for the 
identification of endogenous ligands for these novel receptors; the production of 
compositions that modulate the expression or function of the receptors; for gene therapy; 
for mapping functional regions of the receptors; in studying associated physiological 
1 0 pathways; for in vivo prophylactic and therapeutic purposes; as immunogens for producing 
antibodies; in screening for biologically active agents; and the like. 

Before the subject invention is further described, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
1 S variations of the particular embodunents may be made and still fall vdthin the scope of the 
appended claims. It is also to be understood that the terminology employed is for the 
purpose of describing particular embodiments, and is not intended to be limiting. Instead, 
the scope of the present invention will be established by the appended claims, 

20 In this specification and the appended claims, the singular forms "a," "an," and 

"the" include plural reference unless the context clearly dictates otherwise. Unless defined 
otherwise, all technical and scientific terms used herein have the same meaning as 
commonly understood to one of ordinary skill in the art to which this invention belongs. 

25 Characterization of LGR4, LGR5 and LGR7 

LGR4, LGR5 and LGR7 are novel mammalian receptors of the G-protein coupled, 
seven trans-membrane family of proteins, specifically the subfamily of G-protein coupled 
seven trans-membrane proteins which are characterized by the presence of extra-cellular 
leucine rich repeat regions. As such, these proteins have trans-membrane segments and 

30 extra-cellular regions similar to those found in the known LH, FSH, and TSH receptors. In 
other words, these proteins have both a G-protein coupled seven trans-membrane region 

3 
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and a leucine rich repeat extra-cellular domain. The N-terminal extra-cellular domains of 
these proteins also show high homology with Drosophila Slit and Toll proteins having 
leucine rich repeats. These proteins are expressed in diverse tissues. 

The human LGR4 gene has a nucleotide sequence as shown in SEQ ID NO:01. 

5 The human LGR4 gene product has an amino acid sequence as shown in SEQ ID NO:02. 
LGR4 is expressed in a plurality of different tissue types, including ovary, testis, adrenal, 
placenta, liver, kidney and intestine. 

The human LGRS gene has a nucleotide sequence as shown in SEQ ID NO:03. 
The LGRS gene product has an amino acid sequence as shown in SEQ ID NO:04. LGRS 

10 has been found to be mainly expressed in muscle, placenta and spinal cord tissue. 

The human LGR7 gene encodes multiple splicing variants, each of which contains 
a multitude of cysteine-rich low density lipoprotein (LDL) binding motifs at the N- 
terminus in addition to the luecine rich repeat region. The longer forms of LGR-7 have a 
higher similarity than shorter fix)ms of LGR-7 to snail LGR in the trans-membrane domain 

IS and the N-terminal LDL binding domain. The overall structure of both the long and short 
forms of LGR-7 is similar to that of the LH receptor. The human LGR7 short form gene 
has a nucleotide sequence as shown in SEQ ID NO:0S. The LGR7 short form gene 
product has an amino acid sequence as shown in SEQ ID NO:06. The human LGR7 long 
form gene has a nucleotide sequence as shown in SEQ ID NO:07. The LGR7 long form 

20 gene product has an amino acid sequence as shown in SEQ ID NO:08. LGR7 is expressed 
in multiple tissues, including testis, ovary, prostate, intestine and colon. 

Identification of LGR4, LGRS andLGR? Sequences 
Homologs of LGR4, LGRS andLGR? are identified by any of a number of 
2S methods. A fragment of the provided cDNA may be used as a hybridization probe against 
a cDNA library from the target organism of interest, where low stringency conditions are 
used. The probe may be a large fragment, or one or more short degenerate primers. 

Nucleic acids having sequence similarity are detected by hybridization under low 
stringency conditions, for example, at SO'^C and 6xSSC (0.9 M sodium chloride/0.09 M 
30 sodium citrate) and remain bound when subjected to washing at SS^C in I xSSC (0. 1 S M 
sodium chloride/O.OlS M sodium citrate). Sequence identity may be determined by 

4 
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hybridization under stringent conditions, for example, at 50°C or higher and O.lxSSC (15 
mM sodium chloride/01 .5 mM sodium citrate). Nucleic acids having a region of 
substantial identity to the provided LGR4, LGR5 and/or LGR7 sequences, e.g. allelic 
variants, genetically altered versions of the gene, etc., bind to the provided sequences 
S imder stringent hybridization conditions. By using probes, particularly labeled probes of 
DNA sequences, one can isolate homologous or related genes. The source of homologous 
genes may be any species, e.g., primate species, particularly human; rodents, such as rats 
and mice; canines; felines; bovines; ovines; equines; yeast; nematodes; etc. 

Between mammalian species, e.g., human and mouse, homologs have substantial 

10 sequence similarity, e.g. at least 75% sequence identity, usually at least 90%, more usually 
at least 95% between nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which may be a subset of a larger sequence, such as a conserved 
motif, coding region, flanking region, etc. A reference sequence will usually be at least 
about 1 8 nt long, more usually at least about 30 nt long, and may extend to the complete 

1 5 sequence that is being compared. Algorithms for sequence analysis are known in the art, 
such as BLAST, described in Altschul et al (1990), J. Mol Biol 215:403-10. Unless 
specified otherwise, all sequence analysis numbers provided herein are as determmed with 
the BLAST program using default settings. The sequences provided herein are essential 
for recognizing LGR4, LGR5 and IGif7-related and homologous proteins in database 

20 searches. 

LGR4, LGR5 and LGR7 >JUCLEic acid compositions 
Nucleic acids encoding LGR4, LGR5 and LGR7 may be cDNA or genomic DNA 
or a fragment thereof. The terms ''LGR4 gene, " "LGR5 gene " and "IGi?7 gene" shall be 
25 intended to mean the open reading frame encoding specific LGR4, LGR5 and LGR7 

polypeptides, and LGR4, LGR5 and LGR7 introns, as well as adjacent 5' and 3* non-coding 
nucleotide sequences involved in the regulation of expression, up to about 20 kb beyond 
the coding region, but possibly further in either direction. The gene may be introduced 
into an appropriate vector for extra-chromosomal maintenance or for integration into a 
30 host genome. 



5 
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The term "cDNA*' as used herein is intended to include all nucleic acids that share 
the arrangement of sequence elements found in native mature mRNA species, where 
sequence elements are exons and 3' and S' non-codmg regions. Normally mRNA species 
have contiguous exons, with the intervening introns, when present, removed by nuclear 
S RNA splicing, to create a continuous open reading frame encoding an LGR4, LGRS and 
LGR7 protein. 

A genomic sequence of interest comprises the nucleic acid present between the 
initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It may further include the 3' 

10 and 5' untranslated regions found in the mature mRNA. It may further include specific 
transcriptional and transiational regulatory sequences, such as promoters, enhancers, etc., 
including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 ' or 3' 
end of the transcribed region. The genomic DNA may be isolated as a fragment of 1 00 
kbp or smaller; and substantially free of flanking chromosomal sequence. The genomic 

1 5 DNA flanking the coding region, either 3' or 5\ or internal regulatory sequences as 
sometimes foimd in introns, contains sequences required for proper tissue and stage 
specific expression. 

The sequence of the 5* flanking region may be utilized for promoter elements, 
including enhancer binding sites, that provide for developmental regulation in tissues 

20 where LGR4, LGRS and/or LGR7 is expressed. The tissue specific expression is usefiil for 
determining the pattern of expression, and for providing promoters that mimic the native 
pattern of expression. Naturally occurring polymorphisms in the promoter region are 
useful for determining natural variations in expression, particularly those that may be 
associated with disease. 

25 Altematively, mutations may be introduced into the promoter region to determine 

the effect of altering expression in experimentally defmed systems. Methods for the 
identification of specific DNA motifs involved in the binding of transcriptional factors are 
known in the art, e.g. sequence similarity to known binding motifs, gel retardation studies, 
etc. For examples, see Blackwell et al (1995), Mol. Med, 1:194-205; Mortlock et al 

30 (1996), Genome Res, 6:327-33; and Joulin and Richard-Foy (1995), Eur. J. Biochem. 
232:620-626. 



6 
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The regulatory sequences may be used to identify cis acting sequences required for 
transcriptional or translational regulation of LGR4. LGR5 and/or IGJ? 7 expression, 
especially in different tissues or stages of development, and to identify cis acting 
sequences and /row-acting factors that regulate or mediate LGR4, LGR and/or LGR7 
5 expression. Such transcription or translational control regions may be opeiably linked to 
an LGR4, LGR5 or LGR7 gene in order to promote expression of wild type or altered 
LGR4, LGR5 or LGR7 or other proteins of interest in cultured cells, or in embryonic, fetal 
or adult tissues, and for gene therapy. 

The nucleic acid compositions of the subject invention may encode all or a part of 
10 tiie subject polypeptides. Double or single stranded fragments may be obtained of tiie 
DNA sequence by chemically syntiiesizing oligonucleotides in accordance witii 
conventional metiiods. by restriction enzyme digestion, by PCR amplification, etc. For 
the most part, DNA fragments will be of at least 15 nt, usually at least 18 nt or 25 nt,,and 
may be at least about 50 nt. Such small DNA fragments are useful as primers for PGR, 
15 hybridization screening probes, etc. Larger DNA fragments, i.e. greater tiian 100 nt are 
usefiil for production of tiie encoded polypeptide. For use in amplification reactions, such 
as PGR, a pair of primers will be used. The exact composition of tiie primer sequences is 
not critical to tiie invention, but for most applications tiie primers will hybridize to tfie 
subject sequence under stiingent conditions, as known in tiie art It is preferable to choose 
20 a pair of primers tiiat will generate an amplification product of at least about 50 nt, 
preferably at least about 100 nt. Algoritiuns for tiie selection of primer sequences are 
generally known, and are available in commercial software packages. Amplification 
primers hybridize to complementary strands of DNA, and will prime towards each otiier. 
The LGR4. LGR and LGR7 genes are isolated and obtained in substantial purity, 
25 generally as otiier tiian an intact chromosome. Usually, tiie DNA will be obtained 

substantially fi^e of otiier nucleic acid sequences tiiat do not include an LGR4. LGR5 or 
LGR7 sequence or fragment tiiereof, generally being at least about 50%, usually at least 
about 90% pure and are typically "recombinant", i.e. flanked by one or more nucleotides 
witii which it is not normally associated on a naturally occurring chromosome. 
30 The DNA may also be used to identify expression of tiie gene in a biological 

specimen. The manner in which one probes cells for tiie presence of particular nucleotide 
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sequences, as genomic DNA or RNA, is well established in the literature and does not 
require elaboration here. DNA or mRNA is isolated from a cell sample. The mRNA may 
be amplified by RT-PCR, using reverse transcriptase to form a complementary DNA 
strand, followed by polymerase chain reaction amplification using primers specific for the 
5 subject DNA sequences. Altematively, the mRNA sample is separated by gel 

electrophoresis, transferred to a suitable support, e.g, nitrocellulose, nylon, etc., and then 
probed with a firagment of the subject DNA as a probe. Other techniques, such as 
oligonucleotide ligation assays, in situ hybridizations, and hybridization to DNA probes 
arrayed on a solid chip may also find use. Detection of mRNA hybridizing to the subject 

1 0 sequence is indicative of LGR4, LGR5 and/or LGR7 gene expression in the sample. 

The sequence of an LGR4, LGR5 or LGR7 gene, including flanking promoter 
regions and coding regions, may be mutated in various ways known in the art to generate 
targeted changes in promoter strength, sequence of the encoded protein, etc. The DNA 
sequence or protein product of such a mutation will usually be substantially similar to the 

15 sequences provided herein, i.e. will differ by at least one nucleotide or amino acid, 
respectively, and may differ by at least two but not more than about ten nucleotides or 
amino acids. The sequence changes may be substitutions, insertions, deletions, or a 
combination thereof. Deletions may further include larger changes, such as deletions of a 
domain or exon. Other modifications of interest include epitope tagging, e.g. with the 

20 FLAG system, HA, etc. For studies of subcellular localization, fusion proteins with green 
fluorescent proteins (GFP) may be used. 

Techniques for in vitro mutagenesis of cloned genes are known. Examples of 
protocols for site specific mutagenesis may be found in Gustin et al. (1993), 
Biotechniques 14:22; Barany (1985), Gene 37:1 1 1-23; Colicelli et al (1985), Mol Gen. 

25 Genet. 199:537-9; and Prentki et al. (1984), Gene 29:303-13. Methods for site specific 
mutagenesis can be found in Sambrook et al. Molecular Cloning: A Laboratory Manual, 
CSH Press 1989, pp. 15.3-15.108; Weiner et al (1993), Gene 126:35-41; Sayers et al 
(1992), Biotechniques 13:592-6; Jones and Winistorfer (1992), Biotechniques 12:528-30; 
Barton era/. (1990), Nucleic Acids Res 18:7349-55; Marotti and Tomich (1989), Gene 

30 Anal Tech. 6:67-70; and Zhu (1989), Anal Biochem 177:120-4. Such mutated genes may 

8 
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be used to study structure-function relationships of LGR4, LGR5 and/or LGR7, or to alter 
properties of the protein that affect its function or regulation. 

LGR4, LGR5 and LGR7 Polypeptides 
5 Also provided by the subject invention are LGR4, LGR5 and LGR7 polypeptide 

compositions. The term polyeptide composition as used herein refers to both the fiill 
length proteins as well as portions or fragments thereof. Also included in this term are 
variations of the naturally occurring proteins, where such variations are homologous or 
substantially similar to the naturally occurring protein, be the naturally occurring protein 

10 the human protein, mouse protein, or protein from some other species which naturally 
expresses an LGR4, LGR5 or LGR7 protein, usually a mammalian species. A candidate 
honiologous protein is substantially similar to an LGR4, LGR5 or LGR7 projein of the 
subject invention, and therefore is an LGR4, LGR5 or LGR7 protein of the subject 
invention, if the candidate protein has a sequence that has at least about 80%, usually at 

1 5 least about 90% and more usually at least about 98% sequence identity with an LGR4, 
LGRS or LGR7 protein, as measured by BLAST, supra. In the following description of 
the subject invention, the term "LGR4, LGRS or LGR7-protein" is used to refer not only 
to the human LGR4, LGRS or LGR7 protein, but also to homologs thereof expressed in 
non-human species, e.g. murine, rat and other manmialian species. 

20 The subject gene may be employed for producing all or portions of LGR4, LGRS 

and LGR7 polypeptides. By "LGR4 polypeptide/protein", "LGRS polypeptide/protein," 
and "LGR7 polypeptide/protein" is meant an amino acid sequence encoded by an open 
reading fi^e (ORF) of LGR4, LGRS and LGR7 genes, including the full-length native 
polypeptide and fragments thereof, particularly biologically active fragments and/or 

25 fragments corresponding to ftmctional domains, e.g. extra-cellular regions; and including 
fusions of the subject polypeptides to other proteins or parts thereof, e.g. chimeric 
proteins. For expression, an expression cassette may be employed. The expression vector 
will provide a transcriptional and translational initiation region, which may be inducible or 
constitutive, where the coding region is operably linked under the transcriptional control 

30 of the transcriptional initiation region, and a transcriptional and translational termination 
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region. These control regions may be native to an LGR4. LGR5 or LGR7gtm, or may be 
derived from exogenous sources. 

Expression vectors generally have convenient restriction sites located near the 
promoter sequence to provide for the insertion of nucleic acid sequences encoding 
S heterologous proteins. A selectable marker operative in the expression host may be 
present. Expression vectors may be used for die production of fusion proteins, where the 
exogenous fusion peptide provides additional functionality, i.e. increased protein 
synthesis, stability, reactivity with defmed antisera, an enzyme marker, e.g. 
p-galactosidase, etc. 

10 Expression cassettes may be prepared comprising a transcription initiation region, 

the gene or fragment thereof, and a transcriptional termination region. Of particular 
interest is the use of sequences that allow for the expression of functional epitopes or 
domains, usually at least about 8 amino acids in length, more usually at least about 15 
amino acids in length, to about 25 amino acids, and up to the complete open readmg frame 

15 of the gene. After introduction of the DNA, the cells containing the construct may be 
selected by means of a selectable marker, the cells expanded and then used for expression. 

LGR4, LGR5 or LGR7 polypeptides may be expressed in prokaryotes or 
eukaryotes in accordance with conventional ways, depending upon the purpose for 
expression. For large scale production of the protein, a unicellular organism, such as £. 

20 coll B, subtilis, S. cerevisiae, insect cells in combination with baculovirus vectors, or cells 
of a higher organism such as vertebrates, particularly mammals, e.g. COS 7 cells, may be 
used as the expression host cells. In some situations, it is desirable to express the LGR4, 
LGR5 or LGR7 gene in eukaryotic cells, where the LGR4, LGR5 or LGR7 protein will 
benefit from native folding and post-translational modifications. Small peptides can also 

25 be synthesized in the laboratory. Polypeptides that are subsets of the complete LGR4, 
LGR5 or LGR7 sequence may be used to identify and investigate parts of the protein 
important for function or to raise antibodies directed against these regions. 

For production of the extracellular domain of the LGR4, LGR5 or LGR7 receptor, 
the anchored receptor approach as described in Osuga et al, Mol. Endocrinol. (1997) 1 1 : 

30 1 659- 1 668 may be employed. Likewise, the chimeric receptor approach described in Kudo 
et al, J Biol. Chem. (1996) 271; 22470-22478 may be used. 

10 
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Such peptides find use in the identification of endogenous ligands and in drug 
screening for agonists and atangonists using methods described in Osuga, supra. 
Soiubilized extracellular domains find use as therapeutic agents, e.g. in the neutralization 
of the action of endogenous ligands. 
S With the availability of the protein or fragments thereof in large amounts, by 

employing an expression host, the protein may be isolated and purified in accordance with 
conventional ways. A lysate may be prepared of the expression host and the lysate 
purified using HPLC, exclusion chromatography, gel electrophoresis, affinity 
chromatography, or other purification technique. The purified protein will generally be at 

10 least about 80% pure, preferably at least about 90% pure, and may be up to and including 
100% pure. Pure is intended to mean fi^e of other proteins, as well as cellular debris. 

The expressed LGR4, LGR5 and LGR7 polypeptides are useful for the production 
of antibodies, where short Segments provide for antibodies specific for the particular 
polypeptide, and larger fragments or the entire protein allow for the production of 

IS antibodies over the surface of the polypeptide. Antibodies may be raised to the wild-type 
or variant forms of LGR4, LGR5 or LGR7. Antibodies may be raised to isolated peptides 
corresponding to these domains, or to the native protem. 

Antibodies are prepared in accordance with conventional ways, where the 
expressed polypeptide or protein is used as an immunogen, by itself or cpnjugated to 

20 known immunogenic carriers, e.g. KLH, pre-S HBsAg, other viral or eukaryotic proteins, 
or the like. Various adjuvants may be employed, with a series of injections, as 
appropriate. Both polyclonal and monoclonal antibodies may be produced. For 
monoclonal antibodies, after one or more booster injections, the spleen is isolated, the 
lymphocytes immortalized by cell fusion, and then screened for high affinity antibody 

25 binding. The immortalized cells, i.e. hybridomas, producing the desired antibodies may 
then be expanded. For further description, see Monoclonal Antibodies: A Laboratory 
Manual . Harlow and Lane eds., Cold Spring Harbor Laboratories, Cold Spring Harbor, 
New York, 1988. If desired, the mRNA encoding the heavy and light chains may be 
isolated and mutagenized by cloning in E, colU and the heavy and light chains mixed to 

30 further enhance the affinity of the antibody. Altematives to in vivo immimization as a 
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method of raising antibodies include binding to phage ''display" libraries, usually in 
conjunction with in vitro afiSnity maturation. 

Diagnostic Uses 

S The subject nucleic acid and/or polypeptide compositions may be used to analyze a 

patient sample for the presence of polymorphisms associated with a disease state or 
genetic predisposition to a disease state. Biochemical studies may be performed to 
determine whether a sequence polymorphism in an LGR4, LGR or LGRJcoding region or 
control regions is associated with disease. Disease associated polymorphisms may include 

10 deletion or truncation of the gene, mutations that alter expression level, that affect the 
activity of the protein, and the like. 

Changes in the promoter or enhancer sequence that may affect expression levels of 
LGR4, LGR5 or LGR7 can be compared to expression levels of the normal allele by 
various methods known in the art. Methods for detennining promoter or enhancer 

1 S strength include quantitation of the expressed natural protein; insertion of the variant 
control element into a vector with a reporter gene such as P-galactosidase, luciferase, 
chloramphenicol acetyltransferase, etc, that provides for convenient quantitation; and the 
like. 

A number of methods are available for analyzing nucleic acids for the presence of 
20 a specific sequence, e.g. a disease associated polymorphism. Where large amounts of 
DNA are available, genomic DNA is used directly. Altematively, the region of interest is 
cloned into a suitable vector and grown in sufficient quantity for analysis. Cells that 
express LGR4, LGR5 or LGR7 may be used as a source of mRNA, which may be assayed 
directly or reverse transcribed into cDNA for analysis. The nucleic acid may be amplified 
25 by conventional techniques, such as the polymerase chain reaction (PCR), to provide 
sufficient amounts for analysis. The use of the polymerase chain reaction is described in 
Saiki, et al. (1985), Science 239:487, and a review of techniques may be found in 
Sambrook, et al. Molecular Cloning: A Laboratory Manual. CSH Press 1989, pp. 14.2- 
14.33. Altematively, various methods are known in the art that utilize oligonucleotide 
30 ligation as a means of detecting polymorphisms, for examples see Riley et al ( 1 990), 
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NucL Acids Res. 18:2887-2890; and Delahunty et al (1996), Am, J. Hum. Genet. 58:1239- 
1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
5 phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2*,7-dimethoxy-4',5'- 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 
2^4^7^4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'- 
tetraraethyl-6-carboxyrhodamine (TAMRA), radioactive labels, e.g. "P, "S, ^H; etc. The 
label may be a two stage system, where the amplified DNA is conjugated to biotin, 

10 haptens, etc, having a high affinity binding partner, e.g. avidin, specific antibodies, etc,^ 
where the binding partner is conjugated to a detectable label. The label may be 
conjugated to one or both of the primers. Alternatively, the pool of nucleotides used in the 
amplification is labeled, so as to incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned Augment, is analyzed by one of a 

IS number of methods known in the art. The nucleic acid may be sequenced by dideoxy or 
other methods, and the sequence of bases compared to a wild-type LGR4, LGR5 or LGR7 
sequence. Hybridization with the variant sequence may also be used to determine its 
presence, by Southern blots, dot blots, etc. The hybridization pattern of a control and 
variant sequence to an array of oligonucleotide probes immobilized on a solid support, as 

20 described in US 5,445,934, or in WO 95/35505 (the disclosures of which are herein 
incorporated by reference), may also be used as a means of detecting the presence of 
variant sequences. Single strand conformational polymorphism (SSCP) analysis, 
denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices 
are used to detect conformational changes created by DNA sequence variation as 

25 alterations in electrophoretic mobility. Alternatively, where a polymorphism creates or 
destroys a recognition site for a restriction endonuclease, the sample is digested with that 
endonuclease, and the products size firactionated to determine whether the firagment was 
digested. Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

30 Screenmg for mutations in LGR4, LGR5 or LGR7 may be based on the fimctional 

or antigenic characteristics of the protein. Protein truncation assays are usefiil in detecting 
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deletions that may affect the biological activity of the protein. Various immunoassays 
designed to detect polymorphisms in LGR4, LGR5 or LGR7 proteins may be used in 
screening. Where many diverse genetic mutations lead to a particular disease phenotype, 
functional protein assays have proven to be effective screening tools. The activity of the 
5 encoded LGR4, LGRS or LGR7 protein may be determined by comparison with the wild- 
type protein. 

Antibodies specific for LGR4, LGR5 or LGR7 proteins may be used in staining or 
in inununoassays. Samples, as used herein, include biological fluids such as semen, 
blood, cerebrospinal fluid, tears, saliva, lymph, dialysis fluid and the like; organ or tissue 
10 culture derived fluids; and fluids extracted from physiological tissues. Also included in 
the term are derivatives and fractions of such fluids. The cells may be dissociated, in the 
case of solid tissues, or tissue sections may be analyzed. Alternatively a lysate of the cells 
may be prepared. 

Diagnosis may be performed by a mmiber of methods to determine the absence or 

1 S presence or altered amounts of normal or abnormal LGR4, LGRS or LGR7 in patient 
cells. For example, detection may utilize staining of cells or histological sections, 
performed in accordance with conventional methods. Cells are penneabilized to stain 
cytoplasmic molecules. The antibodies of interest are added to the cell sample, and 
incubated for a period of time sufficient to allow binding to the epitope, usually at least 

20 about 10 minutes. The antibody may be labeled with radioisotopes, enzymes, fluorescers, 
chemiluminescers, or other labels for direct detection. Alternatively, a second stage 
antibody or reagent is used to amplify the signal. Such reagents are well known in the art. 
For example, the primary antibody may be conjugated to biotin, with horseradish 
peroxidase-conjugated avidin added as a second stage reagent. Alternatively, the 

25 secondary antibody conjugated to a flourescent compound, e.g. fluorescein, rhodamine, 
Texas red, etc. Final detection uses a substrate that undergoes a color change in the 
presence of the peroxidase. The absence or presence of antibody binding may be 
determined by various methods, including flow cytometry of dissociated cells, 
microscopy, radiography, scintillation counting, eic. 

30 Diagnostic screening may also be performed for polymorphisms that are 

genetically linked to a disease predisposition, particularly through the use of microsatellite 
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markers or single nucleotide polymorphisms. Frequently the microsatellite polymorphism 
itself is not phenotypically expressed, but is linked to sequences that result in a disease 
predisposition. However, in some cases the microsatellite sequence itself may affect gene 
expression. Microsatellite linkage analysis may be performed alone, or in combination 
5 with direct detection of polymorphisms, as described above. The use of microsatellite 
markers for genotyping is well docimiented. For examples, see Mansfield et al (1994), 
Genomics 24:225-233; Ziegle et al (1992), Genomics 14:1026-1031; Dib et al, supra. 

Modulation of LGR4, LGR5 and LGR7 Gene Expression 

1 0 The LGR4, LGR5 or LGR7 genes, gene fragments, or the LGR4, LGR5 or LGR7 

protein or protein fragments, are useful in gene therapy to treat disorders associated with 
LGR4, LGR5 or LGR7 defects. Expression vectors may be used to introduce the LGR4, 
LGR5 or LGR7 gene into a cell. Such vectors generally have convenient restriction sites 
located near the promoter sequence to provide for the msertion of nucleic acid sequences. 

1 5 Transcription cassettes may be prepared comprising a transcription initiation region, the 
target gene or fragment thereof, and a transcriptional termination region. The 
transcription cassettes may be introduced into a variety of vectors, e.g. plasmid; retrovirtis, 
e.g. lentivirus; adenovirus; and the like, where the vectors are able to transiently or stably 
be maintained in the cells, usually for a period of at least about one day, more usually for a 

20 period of at least about several days to several weeks. 

The gene or LGR4, LGR5 or LGR7 protein may be introduced into tissues or host 
cells by any number of routes, including viral infection, microinjection, or fusion of 
vesicles. Jet injection may also be used for intramuscular administration, as described by 
Furth et al (1992), Anal Biochem 205:365-368. The DNA may be coated onto gold 

25 microparticles, and delivered intradermally by a particle bombardment device, or "gene 
gun" as described in the literature (see, for example, Tang et al (1992), Nature 
356:152-154), where gold microprojectiles are coated with the LGR4, LGR5 or LGR7 
DNA, then bombarded into skm cells. 

Antisense molecules can be used to down-regulate expression of LGR4, LGR5, or 

30 LGR7 in cells. The anti-sense reagent may be antisense oligonucleotides (ODN), 

particularly synthetic ODN having chemical modifications from native nucleic acids, or 
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nucleic acid constructs that express such anti-sense molecules as RNA. The antisense 
sequence is complementary to the mRNA of the targeted gene, and inhibits expression of 
the targeted gene products. Antisense molecules inhibit gene expression through various 
mechanisms, e.g. by reducing the amount of mRNA available for translation, through 
5 activation of RNAse H, or steric hindrance. One or a combination of antisense molecules 
may be administered, where a combination may comprise multiple different sequences. 

Antisense molecules may be produced by expression of all or a part of the target 
gene sequence in an appropriate vector, where the transcriptional initiation is oriented 
such that an antisense strand is produced as an RNA molecule. Alternatively, the 

10 antisense molecule is a synthetic oligonucleotide. Antisense oligonucleotides will 
generally be at least about 7, usually at least about 12, more usually at least about 20 
nucleotides in length, and not more than about 500, usually not more than about 50, more 
usually not more than about 35 nucleotides in length, where the length is govemed by 
efficiency of inhibition, specificity, including absence of cross-reactivity, and the like. It 

1 5 has been found that short oligonucleotides, of from 7 to 8 bases in length, can be strong 
and selective inhibitors of gene expression (see Wagner et al (1996), Nature Biotechnol 
14:840-844). 

A specific region or regions of the endogenous sense strand mRNA sequence is 
chosen to be complemented by the antisense sequence. Selection of a specific sequence 

20 for the oligonucleotide may use an empirical method, where several candidate sequences 
are assayed for inhibition of expression of the target gene in an in vitro or animal model. 
A combination of sequences may also be used, where several regions of the mRNA 
sequence are selected for antisense complementation. 

Antisense oligonucleotides may be chemically synthesized by methods known in 

25 the art (see Wagner et al (1993), supra, and Milligan et al, supra.) Preferred 

oligonucleotides are chemically modified from the native phosphodiester structure, in 
order to increase their intracellular stability and binding affinity. A number of such 
modifications have been described in the literature, which alter the chemistry of the 
backbone, sugars or heterocyclic bases. 

30 Among usefiil changes in the backbone chemistry are phosphorothioates; 

phosphorodithioates, where both of the non-bridging oxygens are substituted with sulfur; 
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phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral phosphate 
derivatives include 3'-0 -5'-S-phosphorothioate, 3'-S-5 -0-phosphorothioate, S'-CHj-S'-O- 
phosphonate and 3'-NH-S -0-phosphoroamidate. Peptide nucleic acids replace the entire 
ribose phosphodiester backbone with a peptide linkage. Sugar modifications are also used 
S to enhance stability and affinity. The a-anomer of deoxyribose may be used, where the 
base is inverted with respect to the natural P-anomer. The 2'-0H of the ribose sugar may 
be altered to form 2'-0-methyl or 2 -0-allyl sugars, which provides resistance to 
degradation without comprismg afFmity. Modification of the heterocyclic bases must 
maintain proper base pairing. Some useful substitutions include deoxyuridine for 

10 deoxythymidine; 5-methyl-2'-deoxycytidine and 5-bromo-2'-deoxycytidine for 

deoxycytidine. 5- propynyl-2'-deoxyuridine and 5-propynyl-2*-deoxycytidine have been 
shown to increase affinity and biological activity when substituted for deoxythymidine 
and deoxycytidine, respectively. 

As an alternative to anti-sense inhibitors, catalytic nucleic acid compounds, e.g. 

IS ribozymes, anti-sense conjugates, etc, may be used to inhibit gene expression. Ribozymes 
may be synthesized in vitro and administered to the patient, or may be encoded on an 
expression vector, from which the ribozyme is synthesized in the targeted cell (for 
example, see International patent application WO 9S2322S, and Beigelman et al (1995), 
Nucl Acids Res, 23:4434-42). Examples of oligonucleotides with catalytic activity are 

20 described in WO 9506764. Conjugates of anti-sense ODN with a metal complex, e.g. 
terpyridylCu(Il), capable of mediating mRNA hydrolysis are described in Bashkin et al. 
{\995\ Appl Biochem. Biotechnol 54:43-56. 

Genetically Altered Cell or Animal Models for LGR4, LGR5 and LGR7 
25 Function 

The subject nucleic acids can be used to generate transgenic, non-human animals 
or site specific gene modifications in cell lines. Transgenic animals may be made through 
homologous recombination, where the normal LGR4, LGR5 or LGR7 locus is altered. 
30 Alternatively, a nucleic acid construct is randomly integrated into the genome. Vectors 
for stable integration include plasmids, retroviruses and other animal viruses, YACs, and 
the like. 
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The modified cells or animals are useful in the study of LGR4, LGR5 and/or LGR7 
function and regulation. For example, a series of small deletions and/or substitutions may 
be made in the host's native LGR4, LGR5 or LGR7 gene to determine the role of different 
exons. Of interest is the use of LGR4, LGRS or LGR7 to construct transgenic animal 
models for disease states. Specific constructs of interest include anti-sense LGR4, LGRS 
or LGR7, which will block LGR4, LGRS or LGR7 expression, expression of dominant 
negative LGR4, LGRS or LGR7 mutations, and over-expression of LGR4, LGRS or LGR7 
genes. Where an LGR4, LGRS or LGR7 sequence is introduced, the introduced sequence 
may be either a complete or partial sequence of an LGR4. LGRS or LGR7 gene native to 
the host, or may be a complete or partial LGR4, LGRS or LGR7 sequence that is 
exogenous to the host animal, e.g., a human LGR4, LGRS or LGR7 sequence. A 
detectable marker, such as lac Z may be introduced into the LGR4, LGRS or LGR7 locus, 
where upregulation of LGR4, LGRS or LGR7 expression will resuh in an easily detected 
change in phenotype. 

One may also provide for expression of the LGR4, LGRS or LGR7 gene or variants 
thereof in cells or tissues where it is not normally expressed, at levels not normally present 
in such cells or tissues, or at abnormal times of development. By providing expression of 
LGR4, LGRS or LGR7 protein in cells in which it is not normally produced, one can 
induce changes in cell behavior, e.g, through LGR4, LGRS or LGR7 mediated activity. 

DNA constructs for homologous recombination will comprise at least a portion of 
the LGR4, LGRS or LGR7 gene, which may or may not be native to the species of the 
host animal, wherein the gene has the desired genetic modification(s), and includes 
regions of homology to the target locus. DNA constructs for random integration need not 
include regions of homology to mediate recombination. Conveniently, markers for 
positive and negative selection are included. Methods for generating cells having targeted 
gene modifications through homologous recombination are known in the art. For various 
techniques for transfecting mammalian cells, see Keown et al (1990), Metk Emymol 
185:S27-S37. 

For embryonic stem (ES) cells, an ES cell line may be employed, or embryonic 
cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig, etc. Such cells are 
grown on an appropriate fibroblast-feeder layer or grown in the presence of leukemia 
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inhibiting factor (LIF). When ES or embryonic ceils have been transformed, they may be 
used to produce transgenic animals. After transformation, the cells are plated onto a 
feeder layer in an appropriate medium. Cells containing the construct may be detected by 
employing a selective medium. After sufRcient time for colonies to grow, they are picked 
5 and analyzed for the occurrence of homologous recombination or integration of the 

construct. Those colonies that are positive may then be used for embryo manipulation and 
blastocyst injection. Blastocysts are obtained from 4 to 6 week old superovulated females. 
The ES cells are trypsinized, and the modified cells are injected into the blastocoel of the 
blastocyst. After injection, the blastocysts are returned to each uterine horn of 

10 pseudopregnant females. Females are then allowed to go to term and the resulting 
offspring screened for the construct. By providing for a different phenotype of the 
blastocyst and the genetically modified cells, chimeric progeny can be readily detected. 

The chimeric animals are screened for the presence of the modified gene and males 
and females having the modification are mated to produce homozygous progeny. If the 

1 S gene alterations cause lethality at some point in development, tissues or organs can be 
maintained as allogeneic or congenic grafts or transplants, or in in vitro culture. The 
transgenic animals may be any non-human mammal, such as laboratory animals, domestic 
animals, etc. The transgenic animals may be used in functional studies, drug screening, 
etc, e.g. to determine the effect of a candidate drug on LGR4, LGR5 or LGR7ox related 

20 gene activation etc. 

In vitro models for LGR4, LGR5 or LGR7 Function 
The availability of a nimiber of components in the G-protein coupled receptor 
family, as previously described, allows in vitro reconstruction of the processes or systems 

25 in which members of this family operate. Two or more of the components, such as the 
isolated receptor and a potential ligand therefore, may be combined in vitro, and the 
behavior assessed in terms of activation of transcription of specific target sequences; 
modification of protein components, e.g. proteolytic processing, phosphorylation, 
methylation, etc.\ ability of different protein components to bind to each other. The 

30 components may be modified by sequence deletion, substitution, etc. to determine the 
functional role of specific domains. 
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Drug screening may be performed using an in vitro model, a genetically altered 
cell or animal, purified LGR4, LGR5 or LGR7 protein, as well as fragments or portions 
thereof, e.g. solubilized extra*cellular domain or chimeric receptor proteins comprising the 
LGR4, LGR5 or LGR7 extra-cellular domain. One can identify ligands or substrates that 
5 bind to and modulate the action of LGR4, LGR5 or LGR7. Areas of investigation include 
the development of agents that beneficially coimter abnonnalities related to LGR4, LGR5 
or LGR7 and the use of such agents in the therapy. 

Drug screening identifies agents that modulate the activity of LGR4, LGR5 or 
LGR7 function in abnormal cells. Of particular interest are screening assays for agents 

10 that have a low toxicity for human cells. A wide variety of assays may be used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays for protein binding, and the like. The purified protein may 
also be used for determination of three-dimensional crystal structure, which can be used 
for modeling intermolecular interactions, such as GTP binding, etc. 

IS The term "agent" as used herein describes any molecule, e.g. protein or 

pharmaceutical, with the capability of altering or mimicking the physiological function of 
LGR4, LGRS or LGR7. Generally a plurality of assay mixtures are run in parallel with 
different agent concentrations to obtam a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control, i.e. at 

20 zero concentration or below the level of detection. 

In some embodiments, candidate agents encompass numerous chemical classes, 
though typically they are organic molecules, preferably small organic compounds having a 
molecular weight of more than 50 and less than about 2,500 daltons. Candidate agents 
comprise functional groups necessary for structural interaction with proteins, particularly 

25 hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl 
group, preferably at least two of the fimctional chemical groups. The candidate agents 
often comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents 
are also found among biomolecules including peptides, saccharides, fatty acids, steroids, 

30 purines, pyrimidines, derivatives, structural analogs or combinations thereof 

20 



wo 99/48921 



PCT/US99/06573 



Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random 
and directed synthesis of a wide variety of organic compounds and biomolecules, 
including expression of randomized oligonucleotides and oligopeptides. Alternatively, 
5 libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts 
are available or readily produced. Additionally, natural or synthetically produced libraries 
and compoimds are readily modified through conventional chemical, physical and 
biochemical means, and may be used to produce combinatorial libraries. Known 
pharmacological agents may be subjected to directed or random chemical modifications, 
10 such as acylation, alkylation, esterification, amidification, etc, to produce structural 
analogs. 

Of particular interest in certain embodiments are peptidic agents based on LGR4, 
LGRS or LGR7, e.g. solubilized extra-cellular domain or chimeric receptor proteins 
comprising the LGR4, LGRS or LGR7 extra-cellular domain, where such agents 

1 5 neutralize the activity of endogenous LGR4, LGRS or LGR7 ligands, e.g. hormones. 

Where the screening assay is a binding assay, one or more of the molecules may be 
joined to a label, where the label can directly or indirectly provide a detectable signal. 
Various labels include radioisotopes, fluorescers, chemiluminescers, enzymes, specific 
binding molecules, particles, e.g. magnetic particles, and the like. Specific binding 

20 molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin etc. For 
the specific binding members, the complementary member would normally be labeled 
with a molecule that provides for detection, in accordance with known procedures. 

A variety of other reagents may be included in the screening assay. These include 
reagents like salts, neutral proteins, e.g. albumin, detergents, etc., that are used to facilitate 

25 optimal protein-protein binding and/or reduce non-specific or background interactions. 
Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease 
inhibitors, anti-microbial agents, etc, may be used. The mixture of components are added 
in any order that provides for the requisite binding. Incubations are performed at any 
suitable temperature, typically between 4 and 40^C, Incubation periods are selected for 

30 optimum activity, but may also be optimized to facilitate rapid high-throughput screening. 
Typically between 0.1 and 1 hours will be sufficient. 
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Other assays of interest detect agents that mimic LGR4, LGRS or LGR7 functioa 
For example, an expression construct comprising an LGR4, LGRS or LGR7 gene may be 
introduced into a cell line under conditions that allow expression. The level of LGR4, 
LGRS or LGR7 activity is determined by a functional assay, as previously described. In 
S one screening assay, the ability of candidate agents to inhibit or enhance LGR4, LGRS or 
LGR7 function is determined. Alternatively, candidate agents are added to a cell that 
lacks functional LGR4, LGRS or LGR7, and screened for the ability to reproduce LGR4, 
LGRS or LGR7 activity in a functional assay. 

The compounds having the desired pharmacological activity may be administered 

10 in a physiologically acceptable carrier to a host for treatment, etc. The compounds may 
also be used to enhance LGR4, LGRS or LGR7 function. The inhibitory agents may be 
administered in a variety of ways, orally, topically, parenterally e.g. subcutaneously, 
intraperitoneally, by viral infection, intravascularly, etc. Topical treatments are of 
particular interest. Depending upon the manner of introduction, the compounds may be 

1 S formulated in a variety of ways. The concentration of therapeutically active compoimd in 
the formulation may vary from about 0.1-100 wt.%. 

The pharmaceutical compositions can be prepared in various forms, such as 
granules, tablets, pills, suppositories, capsules, suspensions, salves, lotions and the like. 
Pharmaceutical grade organic or inorganic carriers and/or diluents suitable for oral and 

20 topical use can be used to make up compositions containing the therapeutically-active 
compounds. Diluents known to the art include aqueous media, vegetable and animal oils 
and fats. Stabilizing agents, wetting and emulsifying agents, salts for varying the osmotic 
pressure or buffers for securing an adequate pH value, and skin penetration enhancers can 
be used as auxiliary agents. 

2S 

Experimental 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how to make and use the subject 
invention, and are not intended to limit the scope of what is regarded as the invention. 
30 Efforts have been made to ensure accuracy with respect to the numbers used {e.g. 

amounts, temperature, concentrations, etc.) but some experimental errors and deviations 
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should be allowed for. Unless otherwise indicated, parts are parts by weight, molecular 
weight is average molecular weight, temperature is in degrees centigrade; and pressure is 
at or near atmospheric. 

S Example 1. Identification of LGR4 and LGR5 

Human sequences related to the sea anemone and Drosophila glycoprotein 
hormone receptors were identified from the expression sequence tag database (dbEST) at 
the National Center for Biotechnology Information by using the BLAST server with the 
BLOSUM62 protein comparison matrix (Altschul SF et al. Nucleic Acids Res (1997) 

10 25:3389-3402). Human ESTs showing high homology to two non-overlapping regions of 
the gonadotropin receptors were identified. Clones AA3 12798 and AA298810 were 
found to encode transmembrane four to five of the putative receptor LGR4 whereas 
AA460529 and AA424098 encode transmembrane two to three of the putative receptor 
LGR5. Using these ESTs to fiirther search the GenBank EST division database, 

1 5 overlapping EST sequences were aligned to obtain the longest open reading frame (ORF) 
for these receptors. 

Based on the longest himian ORF, specific primers were designed for PCR 
amplification of LGR4 and LGR5 cDNA fragments from rat ovary and human placenta, 
respectively. After hybridization with labeled EST clones and confirmation of DNA 

20 sequences by dideoxy DNA sequencing, specific receptor fragments isolated were used to 
design primers to prepare sub-cDNA libraries enriched with specific receptor cDNAs. For 
5' extension, reverse transcription was performed using rat ovarian and human placenta 
mRNA preparations and receptor-specific primers. Following second strand synthesis, the 
enriched cDNA pool was tailed at 5 '-ends with specific adaptor sequences to allow fiirther 

25 PCR amplification. For 3 ' extension, rat ovarian or human placenta mRNAs were 

reversed transcribed using oligo-dT, followed by second strand synthesis using receptor- 
specific primers and adaptor tailing. These mini-libraries were fiirther used as templates 
for PCR amplification of upstream or downstream cDNAs specific for each receptor using 
internal primers. PCR products with a strong hybridization signal to each receptor cDNA 

30 fragment were subcloned into the pUCl 8 or pcDNAB vectors. After screening of these 
sublibraries based on colony hybridization using specific receptor probes, clones with 5'- 
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or 3 '-sequences of the putative receptors were identified and isolated for DNA 
sequencing. As needed, the procedure was repeated up to three times to generate cDNAs 
encoding the complete ORF of each putative receptor for sequence analysis and for the 
expression of receptor proteins in eukaryotic cells. The entire coding sequences of each 
5 gene were also amplified with specific primers flanking the entire ORF in independent 
experiments. At least three independent PGR clones were sequenced to verify the 
authenticity of coding sequences. The nucleotide sequence of LGR4, as well as the amino 
acid sequence of the product encoded by the ORF thereof, is provided in Fig. 1. The 
nucleotide sequence of LGR5, as well as the amino acid sequence of the product encoded 
1 0 by the ORF thereof, is provided in Fig. 2. 

Example 2. Comparison of deduced amino acid sequence of LGR4 and 5 cDNAs 
and those encoding FSH and LH receptors. 

1 5 Sequence alignment of LGR4 and LGR5 with known human glycoprotein 

hormone receptors was performed and the results are shown in Fig. 6. Shaded residues are 
identical in at least two of the four receptor proteins shown. 

Example 3. Expression pattern of LGR4 and 5 mRNA transcripts in different 
20 tissues. 

For northern blot analysis, poly (A)+-selected RNA fix)m different human tissues 
was hybridized with a "P-labeled cDNA probes. After washing, the blots were exposed to 
X-ray films at -70C for five days. Subsequent hybridization with a beta-actin cDNA 
25 probe was performed to estimate nucleic acid loading (8 h exposure). LGR4 was shown to 
be expressed in placenta, ovary, testis, adrenal, spinal cord, thyroid, stomach, trachea, 
heart, pancreas, kidney, prostate and spleen while LGR5 was shown to be expressed in the 
skeletal muscle, placenta, spinal cord, brain, adrenal, colon, stomach, ovary and bone 
marrow. 

30 



24 



wo 99/48921 



PCTAJS99/06573 



Example 4. Chromosomal localization of LGR4 and 5 in human. 

Using genomic fragments of LGR4 (> 1 00 Kb) and LGR5 (> 1 00 Kb) as probes, 
chromosomal localization of these genes were detected using the FISH method to banded 
DNA in chromosomal Sq34-35.1 and 12qlS, respectively. 

5 

Example 5. Identification of LGR7. 

Analysis of EST databases has revealed a novel LGR closely related to a G protein- 
coupled receptor from pond snail (Lynmaea stagnalis, accession no. 481946). Because the 
snail G-protein coupled receptor shared the leucine-rich repeat ectodomain and seven 

1 0 transmembrane region characteristics of mammalian LGRs, the novel EST sequence could 
encode either a homologue of snail receptor or a novel mammalian LGR. For the isolation of 
LGR7 cDNA, a Clontech Marathon-ready testis cDNA pool was used as the template for 5' 
and 3 ' RACE with adapter and gene-specific primers. Sequence analysis of the RACE 
products showed that LGR7 gene encode at least two splicing variants differ at the N- 

1 S terminus. The nucleotide sequence of the long variant, as well as the amino acid sequence of 
the product encoded by the ORF thereof, is provided in Fig. 3; while the nucleotide sequence 
of the short variant, as well as the amino acid sequence of the ORF thereof, is provided m Fig. 
4. Both variants contain a classical C-terminal 7-transmembrane region and a leucine-rich 
repeat ectodomain flanked by cysteine rich regions found in other mammalian LGRs. The 

20 long form LGR7 contams extra 35 amino acids in the N-terminal cysteine rich region as 

compared to the short form LGR7. Of interest, analysis of the LGR7 ORF from either variant 
showed that its tertiary structure resembles that of mammalian LGRs instead of the snail 
receptor, which shares the greatest identity in the transmembrane region. These findings 
suggest that LGR7 and snail receptor diverged early during evolution and LGR7 perhaps 

25 adopted new fimction in higher organisms. 

Based on the LGR7 cDNA sequence, we further identified a human genomic DNA 
firagment (AQ053279) in the genomic survey sequence division of GenBank that contains 
part of the LGR7 gene. The authenticity of this genomic clone was confirmed by Southern 
blot hybridization and the genomic clone was used as the probe to identify the 

30 chromosomal localization for LGR7 gene. 
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It is evident from the above discussion and results that three novel mammalian G- 
protein coupled receptors, as well as a nucleic acids encoding the same, are provided by 
the subject invention. The inventions described above fmd use in a variety of applications^ 
including research and therapeutic applications. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The publications 
discussed herein are provided solely for their disclosure prior to the filing date of the 
present application. Nothing herein is to be construed as an admission that the invention 
is not entitled to antedate such a disclosure by virtue of prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily 
apparent to those of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing fit>m the spirit 
or scope of the appended claims. 
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What IS Claimed is: 

1. An isolated nucleic acid encoding a mammalian protein selected from the 
group consisting of LGR4, LGR5 or LGR7. 

5 

2. An isolated nucleic acid according to Claim 1, wherein said mammalian 
protein has the amino acid sequence of SEQ ID N0:2, SEQ ID NO:04, SEQ ID NO:06 or 
SEQ IDNO:08. 

10 3. An isolated nucleic acid according to Claim 1, wherein said mammalian 

protein has an amino acid sequence that is substantially identical to the amino acid 
sequence of SEQ ID N0:2, SEQ ID NO:04, SEQ ID NO:06 or SEQ ID NO:08. 

4. An isolated nucleic acid according to Claim 1, wherein the nucleotide 
IS sequence of said nucleic acid has the sequence selected from the group consisting of: (a) 
SEQ ID N0:1 or the complementary sequence thereof; (b) SEQ ID NO:03 or the 
complementary sequence thereof; (c) SEQ ID NO:05 or the complementary sequence 
thereof; and (d) SEQ ID NO:07 or the complementary sequence thereof. 

20 S. An isolated nucleic acid comprising at least 18 contiguous nucleotides of 

the sequence selected from the group consisting of: (a) SEQ ID N0:1 or the 
complementary sequence thereof; (b) SEQ ID NO:03 or the complementary sequence 
thereof; (c) SEQ ID NO:05 or the complementary sequence thereof; and (d) SEQ ID 
NO:07 or the complementary sequence thereof 

25 

6. An isolated nucleic acid comprising at least SO contiguous nucleotides of 
the sequence selected from the group consisting of: (a) SEQ ID N0:1 or the 
complementary sequence thereof; (b) SEQ ID NO:03 or the complementary sequence 
thereof; (c) SEQ ID NO:05 or the complementary sequence thereof; and (d) SEQ ID 
30 NO:07 or the complementary sequence thereof. 
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7. An isolated nucleic acid that hybridizes under stringent conditions to a 
nucleic acid having the nucleotide sequence selected from the group consisting of: (a) 
SEQ ID N0:1 or the complementary sequence thereof; (b) SEQ ID NO:03 or the 

5 complementary sequence thereof; (c) SEQ ID NO:05 or the complementary sequence 
thereof; and (d) SEQ ID NO:07 or the complementary sequence thereof 

8. An expression cassette comprising a transcriptional initiation region 
functional in an expression host, a nucleic acid having a sequence of the isolated nucleic 

10 acid according to Claim 1 under the transcriptional regulation of said transcriptional 
initiation region, and a transcriptional termination region functional in said expression 
host. 

9. A cell comprising an expression cassette according to Claim 8 as part of an 
IS extrachromosomal element or integrated into the genome of a host cell as a result of 

introduction of said expression cassette into said host cell, and the cellular progeny of said 
host cell. 

10. A method for producing a mammalian protein selected from the group 
20 consisting of LGR4, LGR5 and LGR7, said method comprising: 

growing a cell according to Claim 9, whereby said mammalian protein is 
expressed; and 

isolating said protein substantially free of other proteins. 

25 1 1 . A purified polypeptide composition comprising at least 50 weight % of the 

protein present as a mammalian protein selected from the group consisting of LGR4, 
LGR5 and LGR7, or a fragment thereof 

1 2. An antibody binding specifically to a manunalian protein selected from the 
30 group consisting of LGR4, LGR5 and LGR7. 
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13. The antibody of Claim 12, wherein said antibody is a monoclonal antibody. 

14. A non-himian transgenic animal model for LGR4, LGR5 or LGR7 gene 
function, wherein said transgenic animal comprises an introduced alteration in an LGR4, 

5 LGR5 or LGR7 gm^. 

15. The animal model of claim 14, wherein said animal is heterozygous for 
said introduced alteration. 

10 16. The animal model of claim 14, wherein said animal is homozygous for said 

introduced alteration. 

17. The animal model of claim 14, wherein said introduced alteration is a 
knockout of endogenous LGR4, LGR5 or LGR7 gene expression. 

15 

1 8. A method of screening a sample for the presence of a ligand for a receptor 
selected from the group consisting of LGR4, LGR5 and LGR7, said method comprising: 

contacting said sample with a receptor selected from the group consisting 
of LGR4, LGR5 and LGR7or a mimetic thereof, and 
20 detecting the presence of a binding event between said receptor and ligand 

in said sample. 
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>LGR4 nucleotide sequence (SEQ ZD NO: 01) 

ATGCCGGGCCCGCTAGGGCTGCTCTGCTTCCTCGCCCTGGGGCTGCTCGGCTCGGCCGGGCCCAGCGGCGCGGCGCCGCCT 
CTCTGCGCGGCGCCCTGCAGCTGCGACGGCGACCGTCGGGTGGACTGCTCCGGAAAGGGGTTGACGGCCGTACCGGAGGGT 
CTCAGCGCCTTCACCCAAGCACTGGATATCAGTATGAACAATATCACCCAGTTACCAGAAGATGCATTTAAGAGTTTCCCA 
TTTCTAGAGGAGCTACAACTGGCTGGTAACGACCTTTCTCTTATCCATCCAAAAGCCTTGTCTGGGCTGAAAGAACTCAAA 
GTCCTAACACTCCAGAATAATCAGTTGAGAACAGTGCCCAGTGAAGCCATTCACGGACTGAGTGCTTTGCAGTCTTTACGC 
TTAGATGCCAACCATATTACCTCAGTCCCGGAGGACAGTTTTGAAGGGCTTGTCCAGTTACGCCATCTGTGGCTGGATGAC 
AACAGCTTGACGGAAGTGCCCGTGCGTCCCCTCAGCAACCTGCCAACCCTGCAGGCGCTGACCTTGGCTCTCAACAACATC 
TCAAGCATCCCTGACTTCGCTTTCACCAACCTTTCAAGCTTGGTGGTTCTGCATCTGCATAACAATAAAATTAAAAGCCTC 
AGTCAACACTGTTTTGATGGACTAGATAACCTGGAAACCTTGGACTTGAATTACAATTACTTGGATGAGTTTCCTCAGGCT 
ATTAAAGCCCTTCCCAGCCTTAAAGAGCTGGGATTTCACAGTAATTCTATTTCTGTTATTCCTGATGGAGCATTTGGTGGT 
AATCCACTGCTAAGAACTATTCATTTGTATGATAATCCTCTGTCTTTTGTGGGGAACTCAGCATTTCACAACCTGTCTGAT 
CTGCATTGCTTAGTCATTCGTGGTGCAAGCCTGGTGCAGTGGTTCCCCAATCTGACCGGAACTGTCCATTTGGAGAGTCTA 
ACCTTGACAGGGACAAAAATAAGCAGCATACCTGATGATCTGTGCCAAAACCAAAAGATGCTGAGGACTCTGGACTTATCT 
TATAACAATATAAGAGACCTTCCAAGTTTTAATGGTTGTCGTGCATTGGAAGAAATTTCATTGCAGCGTAATCAAATCTCC 
CTAATAAAGGAAAATACTTTTCAAGGCCTAACATCTCTAAGGATTCTAGATCTGAGTAGAAACCTGATCCGTG7W\TTCAC 
AGTGGAGCTTTTGCGAAGCTTGGGACAATTACTAACCTGGATGTAAGTTTCAATGAATTAACTTCATTTCCTACGGAAGGC 
CTAAATGGGCTCAATCAACTAAAGCTTGTGGGTAACTTCAAGCTGAAAGACGCCTTGGCAGCCAGAGACTTTGCTAATCTC 
AGGTCTCTATCAGTACCATATGCTTATCAGTGTTGTGCATTTTGGGGGTGTGACTCTTTATGCAAATTAAACACAG/VAGAT 
AACAGCCCCCAAGAACACAGTGTGACAAAAGAGAAAGGTGCTACAGATGCAGCAAATGTCACCAGCACTGCTGAGAACGAA 
GAACATAGCCAAATAATTATCCACTGTACACCTTCAACAGGTGCTTTCAAGCCCTGTGAATATTTACTGGGAAGCTGGATG 
ATTCGCCTTACAGTGTGGTTCATTTTCCTGGTCGCCTTGCTTTTCAACCTGCTTGTCATTTTAACAGTGTTTGCGTCTTGT 
TCATCACTGCCTGCCTCCAAACTCTTCATAGGCTTGATTTCTGTGTCTAACTTACTCATGGGCATCTATACTGGCATCCTT 
ACTTTTCTTGATGCTGTGTCCTGGGGCCGATTTGCCGAATTTGGCATTTGGTGGGAAACTGGCAGCGGCTGCAAGGTAGCC 
GGGTCTCTGGCAGTCTTCTCCTCAGAGAGCGCTGTATTCCTATTAACACTGGCAGCTGTGGAAAGAAGCGTATTTGCAAAG 
GATTTGATGAAACACGGGAAGAGCAGTCACCTCAGACAGTTCCAGGTGGCCGCCCTCTTAGCTTTGCTGGGTGCCGCAGTG 
GCAGGCTGCTTCCCCCTTTTCCACGGAGGGCAATATTCTGCATCGCCCTTGTGTTTGCCGTTTCCTACAGGAGAAACCCCA 
TCGTTAGGATTCACTGTGACCTTAGTGCTATTAAACTCACTGGCATTTTTACTAATGGCCATTATCTACACTAAACTATAC 
TGCAACTTAGAGAAGGAGGACCTGTCGGAAAACTCCCAGTCTAGCGTGATTAAGCACGTTGCCTGGCTCATCTTCACAAAC 
TGCATCTTCTTCTGCCCTGTTGCATTTTTCTCATTTGCACCATTGATCACGGCAATCTCCATCAGCCCCGAGATAATGAAG 
TCTGTTACACTGATATTCTTCCCGTTGCCTGCTTGCCTGAATCCGGTCCTGTATGTTTTCTTCAACCCAAAGTTTAAAGAA 
GACTGGAAGCTACTGAAGCGGCGTGTTACCAGGAAACACGGATCTGTTTCAGTTTCCATCAGCAGCCAAGGCGGTTGTGGG 
GAACAGGATTTCTACTATGACTGTGGCATGTATTCCCACTTGCAGGGTAACCTGACTGTCTGTGACTGCTGTGAGTCATTT 
CTTTTGACAAAACCAGTATCATGCAAACACTTAATAAAATCGCACAGTTGTCCTGTATTGACAGCGGCCTCTTGCCAGAGG 
CCAGAGGCCTACTGGTCTGATTGTGGTACACAGTCAGCCCATTCTGACTATGCAGATGAAGAAGATTCCTTTGTCTCAGAC 
AGCTCTGACCAGGTGCAGGCCTGTGGACGAGCCTGCTTCTACCAGAGTCGTGGATTCCCTCTGGTGCGCTATGCTTACAAT 
CTACAGAGAGTCAGAGACTGA 



>LGR4 amino acid sequence (SEQ ID NO: 02) 

MPGPLGLLCFLALGLLGSAGPSGAAPPLCAAPCSCDGDRRVDCSGKGLTAVPEGLSAFTQALDISMNNITQLPEDAFKSFP 
FLEELQLAGNDLSLIHPKALSGLKELKVLTLQNNQLRTVPSEAIHGLSALQSLRLDANHITSVPEDSFEGLVQLRHLWLDD 
NSLTEVPVRPLSNLPTLQALTLALNNISSIPDFAFTNLSSLWLHLHNNKIKSLSQHCFDGLDNLETLDLNYNYLDEFPQA 
IKALPSLKELGFHSNSISVIPDGAFGGNPLLRTIHLYDNPLSFVGNSAFHNLSDLHCLVIRGASLVQWFPNLTGTVHLESL 
TLTGTKISSIPDDLCQNQKMLRTLDLSYNNIRDLPSFNGCRALEEISLQRNQISLIKENTFQGLTSLRILDLSRNLIREIH 
SGAFAKLGTITNLDVSFNELTSFPTEGLNGLNQLKLVGNFKLKDALAARDFANLRSLSVPYAYQCCAFWGCDSLCKLNTED 
NSPQEHSVTKEKGATDAANVTSTAENEEHSQIIIHCTPSTGAFKPCEYLLGSWMIRLTVWFIFLVALLFNLLVILTVFASC 
SSLPASKLFIGLISVSNLLMGIYTGILTFLDAVSWGRFAEFGIWWETGSGCKVAGSLAVFSSESAVFLLTLAAVERSVFAK 
DLMKHGKSSHLRQFQVAALLALLGAAVAGCFPLFHGGQYSASPLCLPFPTGETPSLGFTVTLVLLNSLAFLLMAIIYTKLY 
CNLEKEDLSENSQSS VI KHVAWLI FTNC I FFC PVAFFS FAPLI TAI S I S PEIMKS VTLI FFPLPACLNPVL YVFFNPKFKE 
DWKLLKRRVTRKHGSVSVSISSQGGCGEQDFYYDCGMYSHLQGNLTVCDCCESFLLTKPVSCKHLIKSHSCPVLTAASCQR 
PEAYWSDCGTQSAHSDYADEEDSFVSDSSDQVQACGRACFYQSRGFPLVRYAYNLQRVRD 
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>llueleotide sequence o£ IGR5 (total 2082 nucleotides) (SBQ ZD NO:03) 

CTACATCTCCATAAOUlTAGAATCCACTCCCnSGGAAAGAAATGCTTTGA^ 

AATTAOU^TAACCTTGATGAATTCCCCACTGaATTAGGACACTCTCC^ 

ATCyiGGTCGATACCTGAGAAAGCATTTGTAGGCAACCCTTCrCTTATTACAATACATTO 

GTTGGGAGATCTGCTTTTCAAa^TTTACCTGAACTAAGAACACTGACTCTO 

GATTTAACTGGAACTGCAAACCTGGAGAGTCTGACITTAACTGGAGCACAGATCTCATCTC^ 

CAGTTACCTAATCTCCAAGTGCTAGATCTGTCTTACAACCTATTAGAAGATTTACCCAGTT^^ 

CAGAAAATTGACCTAAGACATAATGAAATCTACGAAATTAAAGTTGACACITTCCA 

AATTTGGCTTGGAAaUUUVTTGCrATTATTCACCCCAATGOlT^ 

TCCAACCTCCroTCGTCTTTTCCTATAACTGGGTTA»TGGTTTAACTCA 

AGCTGGATATCATCTGAAAACTTTCCMAACTCAAGGTXATAGAAATQCCITATGCTO 

TGTGAGAATGCCTATAAGATTTCTAATCAATGGAATAAAGGTGACAACAGCAGTATGGAC6ACCCT 

GGAATGTTTCAGGCTCAAGATGAACGTGACCTTGAAGATTTCCTGCTTGACTTTGAGGAAGACCT 

GTGCAGTGTTCACCTTCCCCAGGCCCCTTCAAACCCTGTGAACACCTGCTTGATGGCTGGCTGAT^ 

ACCATAGCAGTTCTGGCACTTACTTGTAATGCTTTGGTGACTTCAACT^GTTTTC:^ 

AAACTGTTAATTGGGGTCATCGCAGCAGTGAACATGCTCACGGGAGTCTCCAGTGCCGTGCTGGCrrGGTGTGGATGCG^ 

ACTTTTGGCAGCirrGCACGACATGGTGCCTGGTGGGAGAATGGGGTTGGTTGC^ 

GCTTCAGAATCATCTGTTTTCCTGCTTACTCTGGCAGCCCTGGAGCGTGGGTTCT 

ACGAAAGCTCCATTTTCTAGCCnSAAAGTAATCaiTTTTGCTCTGTGC^ 

CTGGGTGGCAGCAAGTATGGCGCCTCCCCTCTCTGCCrGCCTTTGCCITTT^ 

GCTCTCATCTTGCTCAATTCCCTTTGCTTCCTCATGATGACCATTGCCTACy^CCAAGCTCTACTGC^ 

GACCTGGAGAATATTTGGGACTGCTCTATGGTAAAAOVCATTGCCCTGTTGCTCTTCACCAAC^ 

GTGGCTTTCTTGTCCTTCTCCTCTTTAATAAACCrrACATTTATCAGTCCT^ 

GTCCCACTTCCTGOVTGTCTCAATCCCCTTCTCTACATCTTGTrCAATCCTCAC^^ 

AAGCAAACCTACGTCTGGACAAGATOVAAACACCCAAGCTTGATGTauVTTAACT 

GACTCAACTCAAGCCTTGGTAACCTTTACCAGCTCCAGCATCACTTATGACCTGCCTCCCAGTTCCGTGCCATCACC^ 
TATCCAGTGACTGAGAGCroCCATCrTTCCTCrGTGGaiTTTGTCCCATGTCT^ 



>amlno acid sequence of LGRS (total 693 aaino acids) (SBQ ID N0i04) 

IJIIJINITOIHSLGKKCFDGLHSLETLDLNYNNLDEFPTAIRTLSNL^ 

VGRSAFQHLPEUlTLTLNGASQITEFPDLTGTANLESLTLTGAQISSLPQTVCNQLPNliQVLDLSVN^^ 
QKIDLRHNEIYEIKVDTFQQLLSLRSLNIiAWNKIAIIHPNAFSTLPSLIKIJ)LSSNLI^ 

SWISSENFPELKVIEMPYAYQCCAFGVCENAYKISNQWNKGDNSSMDDLHKKDAGMFQAQDERDLEDFLLDFEm 
VQCSPSPGPFKPCEHLriDGWLIRIGVWTIAVLALTCNALVTST\^SPLYISPIKLLIGVIAAVN^ 
TFGSFARHGAWWENGVGGHVIGFLSIFASESSVFLLTLAALERGFSVKYSAKFETKAPFSSLKVIILLO^ 
WGSKYQASPLCLPLPFGEPSTMGYMVALILLNSLCFLNIMTIAYTKLYCNLDK^ 

VAPIiSFSSLINLTFISPEVIKFILLVVVPLPACLNPLLYILFNPHFKEDLVSLRKQTYVWTRSKHPSIJ^ 
DSTQALVTFTSSSITVDLPPSSVPSPAYPVTESCHLSSVAFVPCL 
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>Final LGR7 (LGR7-Long variant) full length sequence (2467 nt) (S£Q ID NO:05). 

GAAAGGAGG/JU^GAAAAAAAGAGGAATGGAAAGAGACAGAGAAAGGAAATGGGAGTGGAAGGAGGGAGGACTGC^ 

GTAACTGCTAAGATTGCAGACAGAAATAGCACACAACCACTGTGAGCTGTATGCGATTCAGAAACCAAGACCAAATT 

TTGCTCACTTTCATTAATCAGTTGCTCAGATAGAAGGAAATGACATCTGGTTCTGTCTTCTTCTACATCTTAATTTT 

TGGAAAATATTTTTCTCATGGGGGTGGACAGGATGTCAAGTGCTCCCTTGGCTATTTCCCCTGTGGGAACATCACAA 

AGTGCTTGCCTCAGCTCCTGCACTGTAACGGTGTGGACGACTGCGGGAATCAGGCCGATGAGGACAACTGTGGAGAC 

AACAATGGATGGTCCATGOUITTTGACAAATATTTTGCCAGTTACTACAAAATGACTTCCCAATATCCTT^ 

AGAAACACCTGAATGTTTGGTCGGTTCTGTGCCAGTGCAATGTCTTTGCCAAGGTCTGGAGCTTGACTGTGATGAAA 

CCAATTTACGAGCTGTTCCATCGGTTTCTTCAAATGTGACTGCAATGTCACTTCAGTGGAACTTAATAAGAAAGCTO 

CCTCCTGATTGCTTCAAGAATTATCATGATCTTCAGAAGCTGTACCrGCAAAACAATAAGATTACATCCATCTCCAT 

CTATGCTTTCAGAGGACTGAATAGCCTTACTAAACTGTATCTCAGTCATAACAGAATAACCTTCCTGAAGCCGGGTG 

TTTTTGAAGATCTTCACAGACTAGAATGGCTGATAATTGAAGATAATCACCTCAGTCGAATTTCCCCACCAACATTT 

TATGGACTAAATTCTCTTATTCTCTTAGTCCTGATGAATAACGTCCTCACCCGTTTACCTGATAAACCTCTCTGTCA 

ACACATGCCAAGACTACATTGGCTGGACCTTGAAGGCAACCATATCCATAATTTAAGAAATTTGACTTTTATTTCCT 

GCAGTAATTTAACTGTTTTAGTGATGAGGAAAAAOUUU^TTAATCACTTAAATGAAAATACTTTTGCACCTCT^ 

AAACTGGATGAATTGGATTTAGGAAGTAATAAGATTGAAAATCTTCCACCGCTTATATTCAAG^^ 

GTCACAATTGAATCOTTCCTATAATCCAATCCAGAAAATTCAAGCAAACCAATTTGAT^ 

CTCTCAGCCTAGAAGGGATTGAAATTTCAAATATCCAACAAAGGATGTTTAGACCTCTTATGAAT^ 

TATTTTAAGAAATTCCAGTACTGTGGGTATGCACCACATGTTCGCAGCTGTAAACCAAACACTGATGGAATTTCATC 

TCTAGAGAATCTCTTGGCAAGCATTATTCAGAGAGTATTTGTCTGGGTTGTATCTGCAGTTACCTGCTTTGGAAACA 

TTTTTGTCATTTGCATGCGACCTTATATCAGGTCTGAGAACAAGCTGTATGCCATGTCAATCATTTCTCTCTGCTGT 

GCCGACTGCTTAATGGGAATATATTTATTCGTGATCGGAGGCTTTGACCTAAAGTTTCGTGGAGAATACAATAAGCA 

TGCGCAGCTGTGGATGGAGAGTACTCATTGTOVGCTTGTAGGATCTTTGGCCATTCTGTCCACAGAAGTATC^ 

TACTGTTAACATTTCTGACATTGGAAAAATAaVTCTGCATTGTCTATCCTTTTAGATGTGTGAGACCTGGAA;^ 

AGAACAATTACAGTTCTGATTCTCATTTGGATTACTGGTTTTATAGTGGCTTTCATTCCATTGAGC^ 

TTTCAAAAACTACTATGGCACCAATGGAGTATGCTTCCCTCTTCATTCAGAAGATACAGAAAGTATTGGAGCCC^ 

TTTATTCAGTGGCAATTTTTCTTGGTATTAATTTGGCCGCATTTATCATCATAGTTTTTTCC^^ 

TATAGTGTTCATCAAAGTGCCATAACAGCAACTGAAATACGGAATCAAGTTAAAAAAGAGATGATCCTTGCCAAACG 

TTTTTTCTTTATAGTATTTACTGATGCATTATGCTGGATACCCATTTTTGTAGTGAAATTTCTTTCACTGCTTCAGG 

TAGAAATACCAGG7ACCATAACCTCTTGGGTAGTGATTTTTATTCTGCCCATTAACAGTGCTTTGAACCCAATTCTC 

TATACTCTGACCyVCAAGACCATTTAAAGAAiVTGATTCATCGGTTTTGGTATAACTACAGACAAAGAAAATCTATGGA 

CAGCAAAGGTCAGAAAACATATGCTCCATCATTCATCTGGGTGGAAATGTGGCCACTGCAGGAGATGCCACCTGAGT 

TAATGAAGCCGGACCTTTTCACATACCCCTGTGAAATGTCACTGATTTCTCAATCAACGAGACTCAATTCCTATTCA 

TGA 



>Final LGR7 (LGR7-long variant, total 757 amino acids)(SEQ ID NO:06) 

MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADEDNCGDNNGWSMQFDKYFA 

SYYKMTSQYPFEAETPECLVGSVPVQCLCQGLELDCDETNIJLAVPSVSSNVTAMSI^WNLIRKLPPDCFK^^ 

LYLQNNKITSISIYAFRGI^SLTKLYLSHNRITFLKPGVFEDI^RLEWLIIEDNHLSRISPPTFYGLNSLILLVI^ 

NVLTRLPDKPLCQHMPRLHWLDLEGNHIHNLRNLTFISCSNLTVLVMRKNKINHI^^ 

NLPPLIFKDLKELSQim^SYNPIQKIQANQFDYLVKLKSLSLEGIEISNIQQRMFRPLMNLSHIYFK^ 

TOSCKPNTIXSISSLENLIiASIIQRVFVWWSAVTCFGNIFVICMRPYIRSENKLYAMSIISLCCADCI^ 

GFDLKFRGEYNKHAQLWMESTHCQLVGSLAILSTEVSVLLLTPLTLEKYICIVYPFRC^ 

FIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGINLAAFIIIVFSYGSMFYSVHQSAITATEI 

RNQVKKEMILAKRFFPIVFTDALCWIPIFVVKFLSLLQVEIPGTITSWWIFILPINSALNPILYTLTTRPFK^ 

RPWYNYRQRKSMDSKGQKTYAPSFIWVEMWPLQEMPPEIiMKPDLFTYPCEMSLISQSTRLNSYS* 
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>Final LGR7 (LGR7-Short variant) full length sequence (3584 nt)(SEQ ID NO:07) 
CTGCTTTGTAACTGCTAAGATTGCAGACAGAAATAGCACACAACCACTGTGAGCTGTATG 

CCAAATTTTGCTCACTTTCATTAATCAGTTGCTCAGATAGAAGGAAATGACATCTGGTTCTGTCTTCTTCTACATCT 

TAATTTTTGGAAAATATTTTTCTCATGGGGGTGGACAGGATGTCAAGTGCTCCCTTGGCTATTTCCCCTGTGGGAAC 

ATCACAAAGTGCTTGCCTCAGCTCCTGCACTGTAACGGTGTGGACGACTGCGGGAATCAGGCCGATGAGGACAACTG 

TGTGGTGGTTTTGTGCCAGTGCATGTCTTTGCCAGGTCTGGAGCTTGACTGGATGAAACCATTTACGAGTGTTCCAT 

CGGTTTCTTCAAATGTGACTGCAATGTCACTTCAGTGGAACTTAATAAGAAAGCTTCCTCCTGATTGCTTCAAGAAT 

TATCATGATCITCAGAAGCTGGACCTGCAAAACAATAAGATTACATCCATCTCCATCTATG^^ 

TAGCCTTACTAAACTGTATCTCAGTO^TAACAGAATAACCTTCCTGAAGCCGGGTGTTTTTGAAG^^ 

TAGAATGGCTGATAATTGAAGATAATCACCTCAGTCGAATTTCCCCACCAACATTTTATGGACTAAATTCTCTTATT 

CTCTTAGTCCTGATGAATAACGTCCTCACCCGTTTACCTGATAAACCTCTCTGTC7UVCACATGCCAAGACTACATTG 

GCTGGACCTTGAAGGCAACCATATCCATAATTTAAGAAATTTGACTTTTATTTCCTGCAGTAATTTAACTGTTTTAG 

tgatgaggaaaaacaaaattaatcacttaaatgaaaatacttttgcacctctccagaaactggatgaattggat^^ 
ggaagtaataagattgaaaatcttccaccgcttatattcaaggacctgaaggagctgtcacaattgaatctttccta 
taatccaatccagaaaattcaagcaaaccaatttgattatcttgtcaaactcaagtctctcagcctag;^^ 
aaatttcaaatatccaao^ggatgtttagacctcttatgaatct 

TGTGGGTATGCACCACATGTTCGCAGCTGTAAACCAAACACTGATGGAATTTCATCTC^^ 
CATTATTCAGAGAGTATTTGTCTGGGTTQTATCTGCAGTTACCTGCTTTGGAAACATTTTTGTCATT^ 

cttatatcaggtctgagaacaagctgtatgccatgtcaatcatttctctctgctgtgccgactgcttaatgggaata 

TATTTATTCGTGATCGGAGGCTTTGACCTAAAGTTTCGTGGAGAATACAATAAGCATGCGCAGCTGTGGATGGAGAG 

tactcattgtcagcttgtaggatctttggccattctgtccacagaagtatcagttttactgttaacatttctgacat 

TGGAAAAATACATCTGCATTGTCTATCCTTTTAGATGTGTGAGACCTGGAAAATGCAGAACAATTACAGTTCTGATT 

ctcatttggattactggttttatagtggctttcattccattgagcaataaggaattt^ 
caatggagtatgcttccctcttcattcagaagatacagaaagtattggagccca6atttattcagtggcaatttttc 

TTGGTATTAATTTGGCCGCATTTATCATCATAGTTTTTTCCTATCGAAGCATGTTTTATAGTGTTCATCA^ 

ataacagcaactgaaatacggaatcaagttaaaaaagagatgatccttgccaaacgttttttctttatagtat 
tgatgcattatgctggatacccatttttgtagtgaaatttctttcactgcttcaggtagaaataccaggtaccataa 

CCTCTTGGGTAGTGATTTTTATTCTGCCCATTAACAGTGCTTTGAACCCAATTCTCTATACTCTGACCACAAGACCA 

tttaaagaaatgattctvtcggttttggtataactacagacaaagaaaatctatggacagcaaaggtcagaaaacata 

tgctccatcattcatctgggtggaaatgtggccactgcaggagatgccacctgagttaatgaagccggaccttttca 

catacccctgtgaaatgtcactgatttctcaatcaacgagactcaattcctattcatgactgactctgaaattcatt 

tcttcgcagagaatactgtgggggtgcttcat6agggatttactggtatgaaaatgaataccacaaaattm 

aataatagctaagataaatattttacaaggacatgaggaaaaataaaaatgactaatgctcttacaaagggaagtm 

ttatatcaataatgtatatatattagtagacattttgcataagaaattaagagaaatctacttcagtaacattca 

catttttctaacatgcatttattgagtacccactactatgtgcatagcattgcaatatagtcctggaagtagacagt 

GCAGAACCTTTCAATCTGTAGATAGTGTTTAATGACAAAAGACTATACAAAGTCCATCTGCAGTTCCTAGTTTAAAG 

TAGAGCTTTACCTGTCT^TGTGCATCAGCAAGAATCATAGGCACXriTAAATAAAGGTTTAAAGTTTT^^ 

GTGTATTTGCATCATAGAAAATGTCTGACTGTTTGCAAAATAATATTCTGTTTTAAGAATCCATCTTACCTCTCCT^ 

AAGTTTCCATACACTTGAGAGCCAACACAACATATTTATTACTAAAAAGATGCTTTGCTAGAA^ 

CTTCTTTTGGCACTTCCTGCCCAGTTTTCTCTTTGCTTTAAATGAACATCATCATATGGAATTGGAATAG^ 

TGAGTACGGCAGAGAAGTGGATCAGAAAAACTAGAATGAGQATAAACATTTACATTAGTGGAAACTCCT^ 

TCCTTGTATTGTCAGTTAACTGATTTTCAACAAGGATGCCAAGACAAAAAGGCTTrrCAACAAACCGTGCTGTTTTA 

AGAACAGACCTAAGTGGTTTAATTCACCCACTTTAGATGGGTGAATGTTATGGTGTGTGAAATATCTCAGTAAAGCA 

GTTAAAAGGAAAAAGAGCTGGAATGCACTGATTCAGGAACTTAATTTCAGGAAGGAAAGGTCTGTATGTACACATTT 

CACTTTAAGCAGAAAATCTTTCTTCAAGAAATGACTTTACTTTCTCTTTGCACTGCCAGCACGTGAGA 

TTTAACTAGTTGTTCTTCTCTAGTCTCTACGTTATTAGNATTTTTTGCTTTCATAATGTGAAACCTTTAAGCAG 

AAGAAAATGTTTTCAGATAGTTTOU^TACNCCAAAAATGTTTGCAACACAAAAATACTGG^ 

CCTTATTGAATATATAGTTGTATAGNTTTGTTCTGAAAACCC 

>Final LGR7-S ORF (722 amino acids) (SEQ ID NO: 08) 

mtsgswfyilifgkyfshgggqdvkcsu3yfpcgnitkclpqllhcngvddcgnqadedncvvvlcq 

dwmkpftsvpsvssnvtamslqwnlirklppdcfknyhdlqkujlqnnkitsisiyafrgi^^ 

kpgvfedlhrlewliiednhlsrispptfyglnslillvij^innvltrlpdkplcq™ 

piscsmitvlvmrknkinhlneotfaplqkijjeldlgsnkienlpplifkdlk^ 

klkslslegieisniqqrmfrpiimnlshiyfkkfqycgyaphvrsckpntdgisslenlja 

fgnifvicmrpyirsenklyamsiislccadclmgiylfviggfdlkfrgeynkhaqlwmesto 

vsvllltfltlekyicivypfrcvrpgkcrtitvliliwitgfivafiplsnkeffknyygtng^ 

gaqiysvaiflginiaafiiivfsygsmfysvhqsaitateirnqvkkemilakrfffivftdalcwipifv^^ 

llqveipgtitswwifilpinsalnpilytlttrpfkemihrfvmrmqrksmdskgqktyapsfiwve^ 

ppbimkpdlftypcemslisqstrlns ys ♦ 
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>Alignment of LGR7-L with LGR7-S 

Query=LGR7-L 

Sbjct=LGR7.S 

Query: 1 MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLIiHCNGVDDCGNQADED 60 

MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGMITKCLPQLLHCNGVDDCGNQADED 
Sbjct: 1 MTSGSVFFyiLIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADED 60 

Query: 61 NCGDNNGWSMQFDKyFASYYKMTSQYPFEAETPECLVGSVPVQCLCQ---GLELDCDETN 117 



Query: 118 LRAVPSVSSl^AMSLQWNLIRKLPPDCFKNYHDLQKLYLQNNKITSISIYAFRGLNSLT 177 

•f VPS VSSNVTAMSLQWmj IRKLPPDCFKNYHDLQKL LQNNKITS IS I YAFRGLNSLT 
Sbjct: 83 FTS\^SVSSNVTAMSLQWNLIRKLPPDCFKimn)IiQKLDIfQNllRITSISIYAFRGI^^ 142 

Query: 178 KLYLSHNRITFLKPGVFEDUmLEWLIIEDNHLSRISPPTFYGIJfSLILLVLMN^ 237 

KLYLSHNRITFIJCPGVFEDLHRLEWLIIEDiraLSRISPPTFYGLNSLILLVLM^^ 
Sbjct: 143 KLYLSHNRITFLKPGVFEDIiHRLEWLIIEDimLSRISPPTFYGLKSLILLVL^^^ 202 

Query: 238 PDKPLCQHMPRLHWU^LEGNHIHNIJmLTFISCSNLTVLVMRKNKIira 297 

PDKPLCQHMPRIJIWU)LEGlraIHNLRNLTFISCSNLTVLV^IRKNKXNHI^ 
Sbjct: 203 PDKPLCQHMPRUIWIJ)LEGNHIHNLRNLTFISCSmiTVLVMRKNK^ 262 

Query: 298 DEU^LGSNKIENLPPLIFKDLKELSQLNLSYNPZQKIQANQFDYLVXLKSLSLEGIE 357 

DELDLGSNKIENLPPLIFKDLKELSQLNLSYNPIQKIQANQFDYLVKLKSIiSLEGIEISN 
Sbjct: 263 DELDLGSNKIENLPPLIFKDLKELSQLNLSYNPIQKIQANQFDYLVKLKSLSLEGIEISN 322 

Query: 358 XQQRMFRPLMNLSHIYFKKFQYCGYAPHVRSCKPNTDGISSLENLIASIIQRVFVWWSA 417 

IQQRMFRPLMNLSHIYFKKFQYCGYAPHVRSCKPNTDGISSLENLLAS I IQRVFVWWSA 
Sbjct: 323 IQQRMFRPU^NLSHIYFKKFQYCGYAPHVRSCKPMTDGISSLENLLASIIQRVFV^ 382 

Query: 418 VTCFGNIFVICMRPYIRSENKLYAMSIZSLCCADCLMGIYLFVZGGFDLKFRGEYNK^ 477 

VTCFGNIFVICMRPYIRSBNKLYAMSIISLCCADCIilGIYLFVIGGFDLKFRGEYNK^ 
Sbjct: 383 VTCFGNIFVICMRPYIRSENKLYAMSIISLCCADCI^GIYLFVIGGFDLKFRGEYNKHAQ 442 

Query: 478 LWMESTHCQLVGSLAILSTEVSVIJJLTFLTLEKYICIVYPFRamPGKC^^ 537 

LWMESTHCQLVGSIAILSTEVSVLLLTFLTLEKYICIVYPFRCVRPGKCRTITVLILIWI 
Sbjct: 443 LWMESTHCQLVGSLAILSTEVSVLIXTFLTLEKYICrVnfPFRCVRPGKCRTIT^ 502 

Query: 538 TGPXVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGINIAA^ 597 

TGFZVAFIPLSNKEFFIOrYYGTNGVCFPIiKSEDTESZGAQIYSVAIFLGINLAAFIIIVF 
Sbjct: 503 TGFIVAFIFLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGimJU^IZIVF 562 

Query: 598 SYGSMFYSVHQSAZTATEZRNQVKKEMZLAKRFFPZVPTDALCWZPZFVVKFLSLLQV^ 657 

SYGSMFYSVHQSAITATEIRNQVKKEMILAKRFFFIVFTDALCWZPZFVVKFLSLLQ^^ 
Sbjct: 563 SYGSMFYSVHQSAITATEIRNQVKKEMIUUO^FFFIVFTDALCWZPZFVWFLSLI^VE 622 

Query: 658 PGTZTSWWZFZLPZNSALNPILYTLTTRPFKEMIHRFWYNYRQRKSMDSKGQKTYAPSF 717 

PGTITSWWZPZLPZNSALNPILYTLTTRPFKEMZHRFWYNYRQRKSMDSKGQKTYAPSF 
Sbjct: 623 PGTZTSWWZFZLPZNSALNPZLYTLTTRPFKEMZHRFWYNYRQRKSMDSKGQKTYAPSF 682 

Query: 718 ZWVEMWPLQEMPPELMKPDLFTYPCEMSLZSQSTRLNSYS 757 

ZWVEMWPLQEMPPELMKPDLFTYPCEMSLZSQSTRLNSYS 
Sbjct: 683 ZWVEMWPLQEMPPELMKPDLFTYPCEMSLZSQSTRLNSYS 722 



Sbjct: 



61 



NC 
NC 



V V C C GLELD + 
WVLCQCMSLPGLELDWMKP- 82 
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FIG. 6 

Signal peptide 

LGR4 MPGPLGLLCFLAIiGLLGSAGPSGA 
LGR5 MDTSRLGVLLSLPVLLQLATG 
LHR MKQRFSALQLLKLLLLLQPPLPRA 
FSHR MALLLVSLLAFLSLGSG 
TSHR MRPADLLQLVLLLDLPRDLGG 

N- flank cysteine-rich sequence 

LGR4 APPL AA-P S DGDR RVD SGKGLTAVPEGI.SAFTQA 

LGR5 GSSPRSGVLLRG P-TH H EPDGRMLLRVD SDLGLSELPSNLSVFTSY 
LHR LREAL P-EP N VPDG--ALR-- PGPTAGLTR 

FSHR HHRI H SNRVFL QESKVTEIPSDLPRNAIE 

TSHR MG SSPP E HQEED--FRVT KDIQRIPSLPPSTQT 

Leucine -rich repeats 

^ ► 4 » ^ 

LGR4 DISMNNITQLPBD KSFPFLEELQLAGN SL HPKALSG KE KVLTLQ Q 
LGR5 DLSMNNISQLLPNPLPSIiHFLEELRLAGNA-- TY PKGA TG YS KVLMLQ Q 
LHR SLAYLPVKVIPSQ RGLNEVIKIEISQI S- ER EANA DN LN SEILIQ TK - 

FSHR RFVLTKLRVIQKG SGFGDLEKIEISQN V- EV EADV SN PK HEIRIEKAN - 
TSHR KLIETHLRTIPSH SNLPNISRIYVSI- VT QQLESHS YN SKVTHIEIR TR - 

X 

LGR4 RTV- SE IHG SA QS RLDA H- TSV EDS--FEGLVQLRH WLD S-L- EV VR 
LGR5 RHV- TE LQN RS QS RLDA H- SYV P-SC-FSGLHSLRH WLD A-L- E VQ 
LHR RYIE -G FIN PG KY SIC- TG RKF DVTKVFSSESNFI- EIC LHI- T GN 
FSHR LYIN -E FQN PN QY LIS- TG KHL DVHK-IHSLQKVL- DIQ INIH - ERN 
TSHR TYID -D LKE PL KF GIF- TGLKMF DLTK-VYSTDIFFI EIT PYM- S VN 

► M P^M 

LGR4 PLSN P-TLQA T AL NISSIPDF T LSS W H HN K-IKSLSQHC D LDN-LE 
LGR5 A RS S-ALQAMT AL KIHHIPDY G LSSWW H HN R-IHSLGKKC D LHS-LE 
LHR A QGMNNESVT K YG GFEEVQSH - GTT TS E KE VHLEKMHNGA R A-TGPK 
FSHR S VG SFESVI W NK GIQEIHNC - GTQ DE N SD NNLEELPNDV H A-SGPV 
TSHR A QG CNETLT K YN GFTSVQQY - GTK DAVY NK KYLTVIDKDA G VYSGPS 

P^M 

LGR4 T LNYNYLDEF Q-AIKA PS KELGFHSNSISVI D-GA GGNPL RTIH - DNPLS 
LGR5 T LNYNNLDEF T-AIRT SN KELGFHSNNIRSI E-KA VGNPS ITIHF- DNPIQ 

LHR T ISSTKLQAL SYGLESIQR I-ATS-SYSLKKL SRET V-N-- LEAT T 

FSHR I ISRTRIHSL SYGLEN KK R-ARSTYN-LKKL TLEKLVA--- MEAS T 

TSHR L VSQTSVTAL SKGLEH KE I-ARNTWT-LKKL LSLS LH--- TRAD S 



LGR4 FVGNSAFHNLSDLHCLVIRGASLVQWFPNLTGTVHLESLTLTGTKISSIPDDLCQNQKML 
LGR5 FVGRSAFQHLPELRTLTLNGASQITEFPDLTGTANLESLTLTGAQISSLPQTVCNQLPNL 
LHR - - - - 
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^< X P^M 

LGR4 RTLDLSYNNIRDLPSFNGCRALEEISLQRNQISLIKENTFQGLTSLRILDLSRNLIREIH 
LGR5 QVLDLSYNLLEDLFSFSVCQKLQKXDLRHNEI YEIKVDTFQQLLSUISLNIJ^WI^ IH 

LHR 

FSHR 

TSHR 



► 

IiGR4 SGAFAKLGTITNLDVSFNELTSFPTEGLNGLNQLK 
LGRS PNAFSTLPSLIKLDLSSNLLSSFPITGLHGLTHLK 

LHR 

FSHR 

TSHR 



C' flank cysteine -rich sequence 



IiGR4 LVGNFKLKDALAARDFANLRSLSV YAYQ WGCDSLCKLNTEDNSPQEHSVTKEKGA 

LGRS LTGNHALQSLISSENFPELKVIEM YAYQ GVCENAYKISNQWNKGDNSSMDDLHKK 

LHR SH RNLPTKEQNFSHSISENFSKQCESTVR 

FSHR SH ANWRRQISELHPICNKSILRQEVDYMT 

TSHR SH KNQKKIRGILESLMCNESSMQSLRQRK 



LGR4 TDAANVTSTAENE HS 

LGRS DAGMFQAQDERDL DF-- 

LHR KVSNKTLYSSMLA SE 

FSHR QTRGQRSSLAEDN SS - 

TSHR SVNAXiNSPLHQEY ENLGDSIVGYKEKSKFQDTHNNAHYYVFFEEQEDEIIGFGQELKNP 



LGR4 


QIIIH 


T 


STGA 


K 


YLLGSWMI 


LGRS 


LLDFEEDLKALHS VQ 


S 


SPGP 


K 


HLLDGWLI 


LHR 


LSGWDYEYGFCLPKTPR- 


A 


EPDA 


N 


DIMGYDFL 


FSHR 


YSRGFDMTYTEFDYDLCNEWDVT 


S 


KPDA 


N 


DIMGYNIL 


TSHR 


QEETLQAFDSHYDYTICGDSEDMV T 


KSDE 


N 


DIMGYKFL 



Transmembrane 

mi m 2 



LGR4 


LTV 


F 


FLV 


LLF 


LL 


ILTVFA 


CSS PASKLFIGLISVSNLLM 


lYTGILTFL 


AVSW 


LGRS 


IGV 


T 


AV 


LTC 


AL 


TSTVFR 


PLYISPIKL 


IGVIAAVNMLT 


VSSAVL 


G 


AF 


F 


LHR 


VLI 


L 


NI 


IMG 


MT 


LFVLLT 


RYK TVPRF 


MCNLSFADFCM 


LYLLLI 


S 


SQ 


K 


FSHR 


VLI 


F 


SI 


ITG 


II 


LVILTT 


QYK TVPRF 


MCNLAFADLCI 


lYLLLI 


S 


IH 


K 


TSHR 


IW 


FVSL 


LLG 


VF 


LLILLT 


HYK NVPRF 


MCNLAFADFCM 


MYLLLI 


S 


LY 


H 



TM 3 



LGR4 


GRFAEFG 


W 


E 


S 


KV 


SLA S 


SA 


FL 


LAAV 


SVFAKDLMKHGKSSH 


QF 


LGRS 


GSFARHGAW 


EN 


V 


HVI 


LSI 


S 


FL 


LAA 


GFSVKYSAKFET APFSSL 


LHR 


GQYYNHA 


D 


Q 


S 


ST 


FT 


L 


YT 


VIT 


WHTITYAIHLDQ LR 


HA 


FSHR 


SQYHNYA 


D 


Q 


A 


DA 


FT 


L 


YT 


AIT 


WHTITHAMQLDC VQ 


HA 


TSHR 


SEYYNHA 


D 


Q 


P 


NT 


FT 


L 


YT 


VIT 


WYAITFAMRLDR IR 


HA 
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TM 4 TM S 



LGR4 


QVAALLALLGAAVAGCF 


FHGGQ 


SASPL 


FPTGETPSLGFTVTLVL 


SL 


LLMA 


LGR5 


KVIILLCALLALTM 


AV 


L 


G 


K 


GASPL 


LPFGEPSTMG 


MVALIL 


SLC 


LMMT 


LHR 


ILIMLGGWLFSSLI 


ML 


V 


V 


N 


MKVSI 


F MDVETTLSQV 


ILTILI 


W 


Flic 


FSHR 


ASVMVMGWIFAFAA 


LF 


IF 


I 


S 


MKVSI 


MDIDSPLSQL 


VMSLLV 


VL 


WIC 


TSHR 


CAIMVGGWVCCFLL 


LL 


V 


I 


S 


AKVSI 


MDTETPLALA 


IVFVLT 


IV 


Vive 



TM € 



LGR4 


II 


T 


L 


CNL-EKEDLSENSQSSVI 


HV 


W 


NCIFFC 


VA 


FSFAPLITAIS 


SPEI 


LGR5 


lA 


T 


L 


CNL-DKGDLENIW CSMV 


HI 


L L 


NCILNC 


VA 


LSF 


SLINLTF 


SPEV 


LHR 


AC 


I 


I 


FAVRNPELMATNK TKIA 


KM 


I 


DFTCMA 


IS 


FAI 


AAFKVPL 


TVTN 


FSHR 


GC 


IHI 


LTVRNPNXVSSSS TRIA 


RM 


M 


DFLCMA 


IS 


FAX 


ASLKVPL 


TVSK 


TSHR 


CCHV 


I 


ITVRNPQYNPGDK TKIA 


RM 


V 


DFICMA 


IS 


YAL 


AILNKPL 


TVSN 



m 7 



LGR4 


M 


SVTLI 


F 


LPA 


L 


V 


VF 


N 


LGR5 


I 


FI LVW 


LPA 


L 


L 


IL 


N 


LHR 


S 


VL VL 


Y 


INS 


A 


F 


AI 


T 


FSHR 


A 


IL VL 


H 


INS 


A 


F 


AI 


T 


TSHR 


S 


IL VL 


Y 


LNS 


A 


F 


AI 


T 



C- terminal tall 

LGR4 PK KE WKL KRRVTRKHGSVSVSISSQGGCGEQDFYYDCGMYSHLQGNLTVCDCCESFL 

LGR5 PH KE LVS RKQTYVWTRSKHPSLMSINSDDVEKQSCDSTQALVTFTSSSITYDLPPSS 

LHR KT QR FFL LSKFGCCKRRAELYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQG 

FSHR KN RR FFI LSKCGCYEMQAQIYRTETSSTVHNTHPRNGHCSSAPRVTNGSTYILVPLS 

TSHR KA QR VFI LSKFGICKRQAQAYRGQRVPPKNSTDIQVQKVTHDMRQGLHNMEDVYELI 

LGR4 LTKPVSCKHLIKSHSCPVLTAASCQRPEAYWSDCGTQSAHSDYADEEDSFVSDSSDQVQA 

LGR5 VPSPAYPVTESCHLSSVAFVPCL 

LHR TALLDKTRYTEC 

FSHR HLAQN 

TSHR ENSHLTPKKQGQISEEYMQTVL 

LGR4 CGRACFYQSRGFPLVRYAYNLQRVRD 
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SEQUENCE LISTING 

<H0> Hsueh, Aaron 
Hsu, Yu Sheau 
Liang, Shan-Guang 
van der Spek, Petrus Johannes 

<120> Novel Mammalian G-Protein Coupled 

Receptors Having Extracellular Leucine Rich Repeat Regions 



<130> SUN-84PCT 



<160> 8 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2856 

<212> DNA 
<213> human 



<400> 1 



lllttll^ cgotagggct gctctgcttc ctcgccctgg ggctgctcgg ctcggccggg 
cccagcggcg cggcgccgcc tctctgcgcg gcgccctgca gctgcgacgg cgaccgtcaa 
gtggactgct ccggaaaggg gttgacggcc gtaccggagg gtc?cagcgc c?tca?ccaa 
llTtlltT caatatcacc cagttaLag aagatg?a?t taaSgtt" 

ttoJ^^^^^ ^ggagctaca actggctggt aacgacottt ctcttatcca tccLaagcc 
lltTnllt^ tgaaagaact caaagtccta acactccaga ataatcagtt gagaacagtg 
altlllttll f ^""^95 actgagtgct ttgcagtctt tacgcttaga ?g?caacS? 
oaJn^o^r^ tcccggagga cagttttgaa gggcttgtcc agttacgoca tctgtggctg 
tllltnl t agtgcccgtg cgtcccctca gcaacc?gcc aac?c?gcag 

gcgctgacct tggctctcaa caacatctca agcatccctg acttogcttt caccaacctt 

tlllltllll lHT''''t aataaaatta aaagc??^ag tllllTclTt 

tttgatggac tagataacct ggaaaccttg gacttgaatt acaattactt ggatgagttt 
?cta??at^c Tft^"'"''' \=^^^9cctt aaagagctgg gatttcacag ?aat?c?att 
atltll I ctgatggagc atttggtggt aatccactgc taagaactat tcatttgtat 
tt^Lc^^^"" tgtcttttgt ggggaactca gcatttcaca acctgtctga tctgca?tgc 
t^aS^n^^ gtggtgcaag cctggtgcag tggttcccca atctgaccgg aac?gtccat 
aacif!™ taaccttgac agggacaaaa ataagcagca tacctgatga tctgtgccaa 

???nt=a?^ J T^'^t^ tctggactta tcttataaca atataagaga ccttccaagt 1080 

tttaatggtt gtcgtgcatt ggaagaaatt tcattgcagc gtaatcaaat ctccctaata 1140 

a^^a^ cttttcaagg cctaacatct ctaaggattc tagatctgag tagaaaocJg 12oS 

aS^caa^^ ttcacagtgg agcttttgcg aagcttggga caattacLI cc?ggatgt? llio 

c?JaJaoaJa ^ atttcctacg gaaggcctaa atgggctcaa tcaactaLg 1320 

tlltll^^ acttcaagct gaaagacgcc ttggcagcca gagactttgc taatctcagg 1380 

aaSttaaaca clTAllllt "^^"^^^^ tgtgcatttt gggggtgtga ctctttatgc itTo 

t^lllt cagaagataa cagcccccaa gaacacagtg tgacaaaaga gaaaggtgct 1500 

clllltltll ^^f^'^^*^"^ cagcactgct gagaacgaag aacatagcca aataa?tatc 560 

cactgtacac cttcaacagg tgctttcaag ccctgtgaat atttactggg aagctggatg 1620 

Saacactat .'r'^^'^" cattttcctg gtcgccttgc ttttcaacc? gc?tg?cat? lllo 

"""f f "g=9t="g ttcatcactg cctgcctcca aactcttcat aggcttgatt 1740 

tctgtgtcta acttactcat gggcatctat actggcatcc ttacttttct tgltgctgtg 1800 

accalllltt ""59"" tggtgggaaa ctggcagcgg c?gcaaggtl 860 

gccgggtctc tggcagtctt ctcctcagag agcgctgtat tcctattaac actggclgct 1920 

Sa?tccaaa ^^^"""^^ aaaggatttg atgaaacacg ggaagagcag tca??tcaga 980 

cagttccagg tggccgccct cttagctttg ctgggtgccg cagtggcagg ctgcttcccc 2040 

cttttccacg gagggcaata ttctgcatcg cccttgtgtt tgccgtttcc tacaggagaa 2100 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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accccatcgt taggattcac tgtgacctta gtgctattaa actcactggc atttttacta 2160 

atggccatta tctacactaa actatactgc aacttagaga aggaggacct gtcggaaaac 2220 

tcccagtcta gcgtgattaa gcacgttgcc tggctcatct tcacaaactg catcttcttc 2280 

tgccctgttg catttttctc atttgcacca ttgatcacgg caatctccat cagccccgag 2340 

ataatgaagt ctgttacact gatattcttc ccgttgcctg cttgcctgaa tccggtcctg 2400 

tatgttttct tcaacccaaa gtttaaagaa gactggaagc tactgaagcg gcgtgttacc 2460 

aggaaacacg gatctgtttc agtttccatc agcagccaag gcggttgtgg ggaacaggat 2520 

ttctactatg actgtggcat gtattcccac ttgcagggta acctgactgt ctgtgactgc 2580 

tgtgagtcat ttcttttgac aaaaccagta tcatgcaaac acttaataaa atcgcacagt 2640 

tgtcctgtat tgacagcggc ctcttgccag aggccagagg cctactggtc tgattgtggt 2700 

acacagtcag cccattctga ctatgcagat gaagaagatt cctttgtctc agacagctct 2760 

gaccaggtgc aggcctgtgg acgagcctgc ttctaccaga gtcgtggatt ccctctggtg 2820 

cgctatgctt acaatctaca gagagtcaga gactga 2856 

<210> 2 

<211> 951 

<212> PRT 

<213> human 

<400> 2 

Met Pro Gly Pro Leu Gly Leu Leu Cys Phe Leu Ala Leu Gly Leu Leu 

1 5 10 15 

Gly Ser Ala Gly Pro Ser Gly Ala Ala Pro Pro Leu Cys Ala Ala Pro 

20 25 30 

Cys Ser Cys Asp Gly Asp Arg Arg Val Asp Cys Ser Gly Lys Gly Leu 

35 40 45 

Thr Ala Val Pro Glu Gly Leu Ser Ala Phe Thr Gin Ala Leu Asp He 

50 55 60 

Ser Met Asn Asn He Thr Gin Leu Pro Glu Asp Ala Phe Lys Ser Phe 
^5 70 75 80 

Pro Phe Leu Glu Glu Leu Gin Leu Ala Gly Asn Asp Leu Ser Leu He 

85 90 95 

His Pro Lys Ala Leu Ser Gly Leu Lys Glu Leu Lys Val Leu Thr Leu 

100 105 110 

Gin Asn Asn Gin Leu Arg Thr Val Pro Ser Glu Ala He His Gly Leu 

115 120 125 

Ser Ala Leu Gin Ser Leu Arg Leu Asp Ala Asn His He Thr Ser Val 

130 135 140 

Pro Glu Asp Ser Phe Glu Gly Leu Val Gin Leu Arg His Leu Trp Leu 
145 150 155 160 

Asp Asp Asn Ser Leu Thr Glu Val Pro Val Arg Pro Leu Ser Asn Leu 

165 170 175 

Pro Thr Leu Gin Ala Leu Thr Leu Ala Leu Asn Asn He Ser Ser He 

180 185 190 

Pro Asp Phe Ala Phe Thr Asn Leu Ser Ser Leu Val Val Leu His Leu 

195 200 205 

His Asn Asn Lys He Lys Ser Leu Ser Gin His Cys Phe Asp Gly Leu 

210 215 220 

Asp Asn Leu Glu Thr Leu Asp Leu Asn Tyr Asn Tyr Leu Asp Glu Phe 
225 230 235 240 

Pro Gin Ala He Lys Ala Leu Pro Ser Leu Lys Glu Leu Gly Phe His 

245 250 255 

Ser Asn Ser He Ser Val He Pro Asp Gly Ala Phe Gly Gly Asn Pro 

260 265 270 

Leu Leu Arg Thr He His Leu Tyr Asp Asn Pro Leu Ser Phe Val Gly 

275 280 285 

Asn Ser Ala Phe His Asn Leu Ser Asp Leu His Cys Leu Val He Arg 
290 295 300 
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Gly Ala Ser Leu 
305 

Leu Glu Ser Leu 

Asp Leu Cys Gin 
340 

Asn Asn lie Arg 
355 

Glu lie Ser Leu 

370 

Phe Gin Gly Leu 
385 

lie Arg Glu lie 

Asn Leu Asp Val 
420 

Leu Asn Gly Leu 
435 

Asp Ala Leu Ala 
450 

Pro Tyr Ala Tyr 
465 

Lys Leu Asn Thr 

Glu Lys Gly Ala 

500 

Glu Glu His Ser 
515 

Phe Lys Pro Cys 
530 

Val Trp Phe lie 
545 

Leu Thr Val Phe 

lie Gly Leu He 
580 

He Leu Thr Phe 
595 

Gly He Trp Trp 
610 

Ala Val Phe Ser 
625 

Val Glu Arg Ser 

Ser His Leu Arg 
660 

Ala Ala Val Ala 
675 

Ala Ser Pro Leu 
690 

Gly Phe Thr Val 
705 

Met Ala He He 

Leu Ser Glu Asn 
740 

He Phe Thr Asn 
755 



Val 


Gin 


Trp 


Phe 




310 






Thr 


Leu 


Thr 


Gly 


325 








Asn 


Gin 


Lys 


Met 




T .011 


Pro 
*r JL \j 










360 


Gin 


Arg 


Asn 


Gin 






375 




Thr 


Ser 


Leu 


Arg 




390 






His 


Ser 


Gly 


Ala 


405 








Ser 


Phe 


Asn 


Glu 


Asn 


Gin 


Leu 


Lys 








440 


Ala 


Arg 


Asp 


Phe 






455 




Gin 


Cys 


Cys 


Ala 




470 






Glu 


Asp 


Asn 


Ser 


485 








Thr 


Asp 


Ala 


Ala 




T 1 p 




T 1 P 








520 


Glu 


Tyr 


Leu 


Leu 






535 




Phe 


Leu 


Val 


Ala 




550 






Ala 


Ser 


Cys 


Ser 


565 








Ser 


Val 


Ser 


Asn 


Leu 


Asp 


AT A 


Va 1 
vox 








600 


Glu 


Thr 


Gly 


Ser 






615 




Ser 


Glu 


Ser 


Ala 




630 






Val 


Phe 


Ala 


Lys 


645 








Gin 


Phe 


Gin 


Val 


\3±y 


Vii»y a 


It lie 


ex. \J 








680 


Cys 


Leu 


Pro 


Phe 






695 




Thr 


Leu 


Val 


Leu 




710 






Tyr 


Thr 


Lys 


Leu 


725 








Ser 


Gin 


Ser 


Ser 


Cys 


He 


Phe 


Phe 



760 



Pro 


Asn 


Leu 


Thr 






315 




Thr 


Lys 


He 


Ser 




330 






Leu 


Arg 


Thr 


Leu 


345 








Phe 


Asn 


Gly 


Cys 


Tl A 

XX1= 


OCX 


T.oti 


X xc 








380 


He 


Leu 


Asp 


Leu 






395 




Phe 


Ala 


Lys 


Leu 




410 






Leu 


Thr 


Ser 


Phe 


425 








Leu 


Val 


Gly 


Asn 


Ala 


Asn 


Leu 


Arg 








4 60 


Phe 


Trp 


Gly 


Cys 






475 




Pro 


Gin 


Glu 


His 




4 90 






Asn 


Val 


Thr 


Ser 


505 








His 


Cys 


Thr 


Pro 


Gly 


Ser 


1 xp 


Met 








540 


Leu 


Leu 


Phe 


Asn 






555 




Ser 


Leu 


Pro 


Ala 




570 






Leu 


Leu 


Met 


Gly 


585 








Ser 


Trp 


Gly 


Arg 




Cys 


Lys 


Vox 








620 


Val 


Phe 


Leu 


Leu 






635 




Asp 


Leu 


Met 


Lys 




650 






Ala 


Ala 


Leu 


Leu 


665 








Leu 


Phe 


His 


Gly 


Pro 


Thr 


Gly 


Glu 








700 


Leu 


Asn 


Ser 


Leu 






715 




Tyr 


Cys 


Asn 


Leu 




730 






Val 


He 


Lys 


His 


745 








Cys 


Pro 


Val 


Ala 



Gly 


Thr 

1 IIX 


Ma 1 
Vox 


nx s 








320 


Ser 


He 


Pro 


Asp 






335 




Asp 


Leu 


Ser 


Tyr 




350 






Arg 


Ala 


Leu 


Glu 


365 








Lys 


Glu 


Asn 


Thr 


C Q T- 
O CX 


Arg 


Asn 


Leu 








400 


Gly 


Thr 


He 


Thr 






415 




Pro 


Thr 


Glu 


Gly 




430 






Phe 


Lys 


Leu 


Lys 


445 








Ser 


Leu 


Ser 


Val 


Asp 


Ocx 


Leu 


Cys 








480 


Ser 


Val 


Thr 


Lys 






495 




Thr 


Ala 


Glu 


Asn 




510 






Ser 


Thr 


Gly 


Ala 


525 








He 


Arg 


Leu 


Thr 


Leu 


Leu 


vox 


Tip 

xxe 








560 


Ser 


Lys 


Leu 


Phe 






575 




He 


Tyr 


Thr 


Gly 




590 






Phe 


Ala 


Glu 


Phe 


605 








Ala 


Gly 


Ser 


Leu 


Thr* 
1 iix 


Leu 


rVXo 


rixo 








640 


His 


Gly 


Lys 


Ser 






655 




Ala 


Leu 


Leu 


Gly 




670 






Gly 


Gin 


Tyr 


Ser 


685 








Thr 


Pro 


Ser 


Leu 


Ala 


Phe 


Leu 


Leu 








720 


Glu 


Lys 


Glu 


Asp 






735 




Val 


Ala 


Trp 


Leu 




750 






Phe 


Phe 


Ser 


Phe 



765 



wo 99/48921 



PCTAJS99/06573 



Ala 


Pro 


Leu 


He 


Thr 


Ala 


lie 


Ser 


He 


Ser 


Pro 


Glu 


He 


Met 


Lys 


Ser 




770 










775 










780 








Val 


Thr 


Leu 


He 


Phe 


Phe 


Pro 


Leu 


Pro 


Ala 


Cys 


Leu 


Asn 


Pro 


Val 


Leu 


785 










790 










795 










800 


Tyr 


Val 


Phe 


Phe 


Asn 
805 


Pro 


Lys 


Phe 


Lys 


Glu 
810 


Asp 


Trp 


Lys 


Leu 


Leu 
815 


Lys 


Arg 


Arg 


Val 


Thr 
820 


Arg 


Lys 


His 


Gly 


Ser 
825 


Val 


Ser 


Val 


Ser 


He 

830 


Ser 


Ser 


Gin 


Gly 


Gly 
835 


Cys 


Gly 


Glu 


Gin 


Asp 
840 


Phe 


Tyr 


Tyr 


Asp 


Cys 
845 


Gly 


Met 


Tyr 


Ser 


His 
850 


Leu 


Gin 


Gly 


Asn 


Leu 
855 


Thr 


Val 


Cys 


Asp 


Cys 
860 


Cys 


Glu 


Ser 


Phe 


Leu 


Leu 


Thr 


Lys 


Pro 


Val 


Ser 


Cys 


Lys 


His 


Leu 


He 


Lys 


Ser 


His 


Ser 


865 










870 










875 










880 


Cys 


Pro 


Val 


Leu 


Thr 
885 


Ala 


Ala 


Ser 


Cys 


Gin 
890 


Arg 


Pro 


Glu 


Ala 


Tyr 
895 


Trp 


Ser 


Asp 


Cys 


Gly 
900 


Thr 


Gin 


Ser 


Ala 


His 
905 


Ser 


Asp 


Tyr 


Ala 


Asp 
910 


Glu 


Glu 


Asp 


Ser 


Phe 
915 


Val 


Ser 


Asp 


Ser 


Ser 
920 


Asp 


Gin 


Val 


Gin 


Ala 
925 


Cys 


Gly 


Arg 


Ala 


Cvs 
930 


Phe 


Tyr 


Gin 


Ser 


Arg 
935 


Gly 


Phe 


Pro 


Leu 


Val 
940 


Arg 


Tyr 


Ala 


Tyr 


Asn 


Leu 


Gin 


Arg 


Val 


Arg 


Asp 





















945 950 



<210> 3 
<211> 2082 
<212> DNA 
<213> human 

<400> 3 

ctacatctcc ataacaatag aatccactcc ctgggaaaga aatgctttga tgggctccac 60 

agcctagaga ctttagattt aaattacaat aaccttgatg aattccccac tgcaattagg 120 

acactctcca acttaaagga actaggattt catagcaaca atatcaggtc gatacctgag 180 

aaagcatttg taggcaaccc ttctcttatt acaatacatt tctatgacaa tcccatccaa 240 

tttgttggga gatctgcttt tcaacattta cctgaactaa gaacactgac tctgaatggt 300 

gcctcacaaa taactgaatt tcctgattta actggaactg caaacctgga gagtctgact 360 

ttaactggag cacagatctc atctcttcct caaaccgtct gcaatcagtt acctaatctc 420 

caagtgctag atctgtctta caacctatta gaagatttac ccagtttttc agtctgccaa . 480 

aagcttcaga aaattgacct aagacataat gaaatctacg aaattaaagt tgacactttc 540 

cagcagttgc ttagcctccg atcgctgaat ttggcttgga acaaaattgc tattattcac 600 

cccaatgcat tttccacttt gccatcccta ataaagctgg acctatcgtc caacctcctg 660 

tcgtcttttc ctataactgg gttacatggt ttaactcact taaaattaac aggaaatcat 720 

gccttacaga gctggatatc atctgaaaac tttccagaac tcaaggtnat agaaatgcct 780 

tatgcttacc agtgctgtgc atttggagtg tgtgagaatg cctataagat ttctaatcaa 840 

tggaataaag gtgacaacag cagtatggac gaccttcata agaaagatgc tggaatgttt 900 

caggctcaag atgaacgtga ccttgaagat ttcctgcttg actttgagga agacctgaaa 960 

gcccttcatt cagtgcagtg ttcaccttcc ccaggcccct tcaaaccctg tgaacacctg 1020 

cttgatggct ggctgatcag aattggagtg tggaccatag cagttctggc acttacttgt 1080 

aatgctttgg tgacttcaac agttttcaga tcccctctgt acatttcccc cattaaactg 1140 

ttaattgggg tcatcgcagc agtgaacatg ctcacgggag tctccagtgc cgtgctggct 1200 

ggtgtggatg cgttcacttt tggcagcttt gcacgacatg gtgcctggtg ggagaatggg 1260 

gttggttgcc atgtcattgg ttttttgtcc atttttgctt cagaatcatc tgttttcctg 1320 

cttactctgg cagccctgga gcgtgggttc tctgtgaaat attctgcaaa atttgaaacg 1380 

aaagctccat tttctagcct gaaagtaatc attttgctct gtgccctgct ggccttgacc 1440 

atggccgcag ttcccctgct gggtggcagc aagtatggcg cctcccctct ctgcctgcct 1500 

ttgccttttg gggagcccag caccatgggc tacatggtcg ctctcatctt gctcaattcc 1560 

ctttgcttcc tcatgatgac cattgcctac accaagctct actgcaattt ggacaaggga 1620 

gacctggaga atatttggga ctgctctatg gtaaaacaca ttgccctgtt gctcttcacc 1680 
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aactgcatcc taaactgccc tgtggctttc ttgtccttct cctctttaat aaaccttaca 1740 

tttatcagtc ctgaagtaat taagtttatc cttctggtgg tagtcccact tcctgcatgt 1800 

ctcaatcccc ttctctacat cttgttcaat cctcacttta aggaggatct ggtgagcctg 1860 

agaaagcaaa cctacgtctg gacaagatca aaacacccaa gcttgatgtc aattaactct 1920 

gatgatgtcg aaaaacagtc ctgtgactca actcaagcct tggtaacctt taccagctcc 1980 

agcatcactt atgacctgcc tcccagttcc gtgccatcac cagcttatcc agtgactgag 2040 

agctgccatc tttcctctgt ggcatttgtc ccatgtctct aa 2082 

<210> 4 
<211> 693 
<212> PRT 
<213> human 



<400> 4 



Leu His 


Leu His 


Asn 


Asn 


Arg 


He 


His 


Ser Leu Gly Lys Lys 


Cys 


Phe 


1 




5 










10 






15 




Asp Gly Leu His 


Ser 


Leu 


Glu 


Thr 


Leu 


Asp 


Leu 


Asn Tyr Asn Asn 


Leu 




20 










25 






30 






Asp Glu 


Phe Pro 
35 


Thr 


Ala 


He 


Arg 
40 


Thr 


Leu 


Ser 


Asn Leu Lys 
45 


Glu 


Leu 


Gly Phe 


His Ser 


Asn 


Asn 


He 


Arg 


Ser 


He 


Pro 


Glu Lys Ala 


Phe 


Val 


50 








55 










60 






Gly Asn 


Pro Ser 


Leu 


He 


Thr 


He 


His 


Phe 


Tyr Asp Asn Pro 


He 


Gin 


65 






70 










75 






80 


Phe Val 


Gly Arg 


Ser 


Ala 


Phe 


Gin 


His 


Leu 


Pro 


Glu Leu Arg 


Thr 


Leu 






85 










90 




95 




Thr Leu 


Asn Gly 


Ala 


Ser 


Gin 


He 


Thr 


Glu 


Phe 


Pro Asp Leu 


Thr 


Gly 




100 










105 






110 




Thr Ala 


Asn Leu 


Glu 


Ser 


Leu 


Thr 


Leu 


Thr 


Gly Ala Gin He 


Ser 


Ser 




115 








120 








125 






Leu Pro 


Gin Thr 


Val 


Cys 


Asn 


Gin 


Leu 


Pro 


Asn 


Leu Gin Val 


Leu 


Asp 


130 








135 










140 




Leu Ser Tyr Asn 


Leu 


Leu 


Glu 


Asp Leu 


Pro 


Ser 


Phe Ser Val 


Cys 


Gin 


145 






150 










155 




160 


Lys Leu Gin Lys 


He 


Asp 


Leu 


Arg 


His 


Asn 


Glu 


He Tyr Glu 


He 


Lys 






165 










170 




175 


Val Asp Thr Phe 


Gin 


Gin 


Leu 


Leu 


Ser 


Leu Arg Ser Leu Asn 


Leu 


Ala 




180 










185 






190 






Trp Asn 


Lys He 
195 


Ala 


He 


He 


His 
200 


Pro 


Asn 


Ala 


Phe Ser Thr 
205 


Leu 


Pro 


Ser Leu 


He Lys 


Leu 


Asp 


Leu 


Ser 


Ser 


Asn 


Leu 


Leu Ser Ser 


Phe 


Pro 


210 








215 










220 






He Thr Gly Leu 


His 


Gly 


Leu 


Thr 


His 


Leu 


Lys 


Leu Thr Gly Asn 


His 


225 






230 










235 






240 


Ala Leu 


Gin Ser 


Trp 


He 


Ser 


Ser 


Glu 


Asn 


Phe 


Pro Glu Leu Lys 


Val 






245 










250 






255 




He Glu 


Met Pro 


Tyr 


Ala 


Tyr 


Gin Cys 


Cys 


Ala 


Phe Gly Val 


Cys 


Glu 




260 










265 






270 




Asn Ala Tyr Lys 


He 


Ser 


Asn 


Gin Trp 


Asn 


Lys 


Gly Asp Asn 


Ser 


Ser 




275 








280 






285 






Met Asp Asp Leu 


His 


Lys 


Lys 


Asp Ala 


Gly Met 


Phe Gin Ala Gin Asp 


290 








295 










300 






Glu Arg Asp Leu 


Glu 


Asp 


Phe 


Leu 


Leu 


Asp 


Phe 


Glu Glu Asp Leu Lys 


305 






310 










315 






320 


Ala Leu 


His Ser 


Val 


Gin 


Cys 


Ser 


Pro 


Ser 


Pro Gly Pro Phe 


Lys 


Pro 






325 










330 






335 




Cys Glu 


His Leu 


Leu 


Asp 


Gly 


Trp Leu 


He Arg He Gly Val Trp Thr 




340 










345 






350 
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He Ala 


Val 


Leu 


Ala 


Leu 


Thr 


Cys 


Asn 


Ala 


Leu 


Val 


Thr 


Ser 


Thr 


Val 




355 










360 










365 








Phe Arg 


Ser 


Pro 


Leu 


Tyr 


He 


Ser 


Pro 


He 


Lys 


Leu 


Leu 


He 


Gly 


Val 


370 










375 










380 










He Ala 


Ala 


Val 


Asn 


Met 


Leu 


Thr 


Gly 


Val 


Ser 


Ser 


Ala 


Val 


Leu 


Ala 


385 








390 










395 










400 


Gly Val 


Asp 


Ala 


Phe 


Thr 


Phe 


Gly 


Ser 


Phe 


Ala 


Arg 


His 


Gly 


Ala 


Trp 








405 










410 










415 




Trp Glu 


Asn 


Gly 


Val 


Gly 


Cys 


His 


Val 


He 


Gly 


Phe 


Leu 


Ser 


He 


Phe 






420 










425 










430 






Ala Ser 


Glu 


Ser 


Ser 


Val 


Phe 


Leu 


Leu 


Thr 


Leu 


Ala 


Ala 


Leu 


Glu 


Arg 




435 










440 










445 






Gly Phe 


Ser 


Val 


Lys 


Tyr 


Ser 


Ala 


Lys 


Phe 


Glu 


Thr 


Lys 


Ala 


Pro 


Phe 


450 










455 










4 60 










Ser Ser 


Leu 


Lys 


Val 


He 


He 


Leu 


Leu 


Cys 


Ala 


Leu 


Leu 


Ala 


Leu 


Thr 


465 








470 










475 










480 


Met Ala 


Ala 


Val 


Pro 


Leu 


Leu 


Gly 


Gly 


Ser 


Lys 


Tyr 


Gly 


Ala 


Ser 


Pro 








485 










490 










495 




Leu Cys 


Leu 


Pro 


Leu 


Pro 


Phe 


Gly 


Glu 


Pro 


Ser 


Thr 


Met 


Gly 


Tyr 


Met 






500 










505 










510 






Val Ala 


Leu 


He 


Leu 


Leu 


Asn 


Ser 


Leu 


Cys 


Phe 


Leu 


Met 


Met 


Thr 


He 




515 










520 










525 








Ala Tyr 


Thr 


Lys 


Leu 


Tyr 


Cys 


Asn 


Leu 


Asp 


Lys 


Gly 


Asp 


Leu 


Glu 


Asn 


530 










535 










540 










He Trp 


Asp 


Cys 


Ser 


Met 


Val 


Lys 


His 


He 


Ala 


Leu 


Leu 


Leu 


Phe 


Thr 


545 








550 










555 










560 


Asn Cys 


He 


Leu 


Asn 


Cys 


Pro 


Val 


Ala 


Phe 


Leu 


Ser 


Phe 


Ser 


Ser 


Leu 








565 










570 










575 




He Asn 


Leu 


Thr 


Phe 


He 


Ser 


Pro 


Glu 


Val 


He 


Lys 


Phe 


He 


Leu 


Leu 






580 










585 








590 






Val Val 


Val 


Pro 


Leu 


Pro 


Ala 


Cys 


Leu 


Asn 


Pro 


Leu 


Leu 


Tyr 


He 


Leu 




595 










600 










605 








Phe Asn 


Pro 


His 


Phe 


Lys 


Glu 


Asp 


Leu 


Val 


Ser 


Leu 


Arg 


Lys 


Gin 


Thr 


610 










615 










620 










Tyr Val 


Trp 


Thr 


Arg 


Ser 


Lys 


His 


Pro 


Ser 


Leu 


Met 


Ser 


He 


Asn 


Ser 


625 








630 










635 










640 


Asp Asp 


Val 


Glu 


Lys 


Gin 


Ser 


Cys 


Asp 


Ser 


Thr 


Gin 


Ala 


Leu 


Val 


Thr 








64 5 










650 










655 




Phe Thr 


Ser 


Ser 


Ser 


He 


Thr 


Tyr 


Asp 


Leu 


Pro 


Pro 


Ser 


Ser 


Val 


Pro 






660 










665 










670 






Ser Pro 


Ala 


Tyr 


Pro 


Val 


Thr 


Glu 


Ser 


Cys 


His 


Leu 


Ser 


Ser 


Val 


Ala 




675 










680 










685 








Phe Val 


Pro 


Cys 


Leu 

























690 



<210> 5 
<211> 2467 
<212> DNA 
<213> human 



<400> 5 

gaaaggagga aagaaaaaaa gaggaatgga aagagacaga gaaaggaaat gggagtggaa 60 

ggagggagga ctgctttgta actgctaaga ttgcagacag aaatagcaca caaccactgt 120 

gagctgtatg cgattcagaa accaagacca aattttgctc actttcatta atcagttgct 180 

cagatagaag gaaatgacat ctggttctgt cttcttctac atcttaattt ttggaaaata 240 

tttttctcat gggggtggac aggatgtcaa gtgctccctt ggctatttcc cctgtgggaa 300 

catcacaaag tgcttgcctc agctcctgca ctgtaacggt gtggacgact gcgggaatca 360 

ggccgatgag gacaactgtg gagacaacaa tggatggtcc atgcaatttg acaaatattt 420 

tgccagttac tacaaaatga cttcccaata tccttttgag gcagaaacac ctgaatgttt 480 



wo 99/48921 



PCT/US99/06573 



ggtcggttct gtgccagtgc aatgtctttg ccaaggtctg gagcttgact gtgatgaaac 540 

caatttacga gctgttccat cggtttcttc aaatgtgact gcaatgtcac ttcagtggaa 600 

cttaataaga aagcttcctc ctgattgctt caagaattat catgatcttc agaagctgta 660 

cctgcaaaac aataagatta catccatctc catctatgct ttcagaggac tgaatagcct 720 

tactaaactg tatctcagtc ataacagaat aaccttcctg aagccgggtg tttttgaaga 780 

tcttcacaga ctagaatggc tgataattga agataatcac ctcagtcgaa tttccccacc 840 

aacattttat ggactaaatt ctcttattct cttagtcctg atgaataacg tcctcacccg 900 

tttacctgat aaacctctct gtcaacacat gccaagacta cattggctgg accttgaagg 960 

caaccatatc cataatttaa gaaatttgac ttttatttcc tgcagtaatt taactgtttt 1020 

agtgatgagg aaaaacaaaa ttaatcactt aaatgaaaat acttttgcac ctctccagaa 1080 

actggatgaa ttggatttag gaagtaataa gattgaaaat cttccaccgc ttatattcaa 1140 

ggacctgaag gagctgtcac aattgaatct ttcctataat ccaatccaga aaattcaagc 1200 

aaaccaattt gattatcttg tcaaactcaa gtctctcagc ctagaaggga ttgaaatttc 1260 

aaatatccaa caaaggatgt ttagacctct tatgaatctc tctcacatat attttaagaa 1320 

attccagtac tgtgggtatg caccacatgt tcgcagctgt aaaccaaaca ctgatggaat 1380 

ttcatctcta gagaatctct tggcaagcat tattcagaga gtatttgtct gggttgtatc 1440 

tgcagttacc tgctttggaa acatttttgt catttgcatg cgaccttata tcaggtctga 1500 

gaacaagctg tatgccatgt caatcatttc tctctgctgt gccgactgct taatgggaat 1560 

atatttattc gtgatcggag gctttgacct aaagtttcgt ggagaataca ataagcatgc 1620 

gcagctgtgg atggagagta ctcattgtca gcttgtagga tctttggcca ttctgtccac 1680 

agaagtatca gttttactgt taacatttct gacattggaa aaatacatct gcattgtcta 174 0 

tccttttaga tgtgtgagac ctggaaaatg cagaacaatt acagttctga ttctcatttg 1800 

gattactggt tttatagtgg ctttcattcc attgagcaat aaggaatttt tcaaaaacta 1860 

ctatggcacc aatggagtat gcttccctct tcattcagaa gatacagaaa gtattggagc 1920 

ccagatttat tcagtggcaa tttttcttgg tattaatttg gccgcattta tcatcatagt 1980 

tttttcctat ggaagcatgt tttatagtgt tcatcaaagt gccataacag caactgaaat 2040 

acggaatcaa gttaaaaaag agatgatcct tgccaaacgt tttttcttta tagtatttac 2100 

tgatgcatta tgctggatac ccatttttgt agtgaaattt ctttcactgc ttcaggtaga 2160 

aataccaggt accataacct cttgggtagt gatttttatt ctgcccatta acagtgcttt 2220 

gaacccaatt ctctatactc tgaccacaag accatttaaa gaaatgattc atcggttttg 2280 

gtataactac agacaaagaa aatctatgga cagcaaaggt cagaaaacat atgctccatc 2340 

attcatctgg gtggaaatgt ggccactgca ggagatgcca cctgagttaa tgaagccgga 2400 

ccttttcaca tacccctgtg aaatgtcact gatttctcaa tcaacgagac tcaattccta 24 60 

ttcatga 2467 

<210> 6 
<211> 757 
<212> PRT 
<213> human 



<400> 


6 


Met Thr 


Ser 


Gly 


1 






Phe Ser 


His 


Gly 






20 


Pro Cys 


Gly 


Asn 




35 




Gly Val 


Asp 


Asp 


50 






Asn Asn 


Gly 


Trp 


65 






Lys Met 


Thr 


Ser 


Val Gly 


Ser 


Val 






100 


Cys Asp 


Glu 


Thr 




115 




Thr Ala 


Met 


Ser 


130 







Ser Val Phe Phe 
5 

Gly Gly Gin Asp 

lie Thr Lys Cys 
40 

Cys Gly Asn Gin 
55 

Ser Met Gin Phe 
70 

Gin Tyr Pro Phe 
85 

Pro Val Gin Cys 

Asn Leu Arg Ala 
120 

Leu Gin Trp Asn 
135 



Tyr 


He 


Leu 


He 




10 






Val 


Lys 


Cys 


Ser 


25 








Leu 


Pro 


Gin 


Leu 


Ala 


Asp Glu 


Asp 








60 


Asp 


Lys 


Tyr 


Phe 






75 




Glu 


Ala 


Glu 


Thr 




90 






Leu 


Cys 


Gin 


Gly 


105 








Val 


Pro 


Ser 


Val 


Leu 


He Arg 


Lys 



140 



Phe 


Gly 


Lys 


Tyr 






15 




Leu 


Gly 


Tyr 


Phe 




30 






Leu 


His 


Cys 


Asn 


45 








Asn 


Cys 


Gly 


Asp 


Ala 


Ser 


Tyr 


Tyr 








80 


Pro 


Glu 


Cys 


Leu 






95 




Leu 


Glu 


Leu 


Asp 




110 






Ser 


Ser 


Asn 


Val 


125 








Leu 


Pro 


Pro 


Asp 
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Cys 


Phe 


Lys 


Asn 


Tyr 


His 


Asp 


Leu 


Gin 


Lys 


Leu Tyr 


Leu 


Gin 


Asn 


Asn 


145 










150 










155 










160 


Lys 


He 


Thr 


Ser 


He 


Ser 


He 


Tyr 


Ala 


Phe Arg Gly 


Leu 


Asn 


Ser 


Leu 










165 










170 










175 




Thr 


Lys 


Leu 


Tyr 


Leu 


Ser 


His 


Asn 


Arg 


He 


Thr 


Phe 


Leu 


Lys 


Pro 


Gly 








180 










185 










190 




Val 


Phe 


Glu 


Asp 


Leu 


His 


Arg 


Leu 


Glu 


Trp Leu 


He 


He 


Glu 


Asp 


Asn 






195 










200 










205 








His 


Leu 


Ser 


Arg 


He 


Ser 


Pro 


Pro 


Thr 


Phe 


Tyr Gly 


Leu 


Asn 


Ser 


Leu 




210 










215 










220 










lie 


Leu 


Leu 


Val 


Leu 


Met 


Asn 


Asn 


Val 


Leu 


Thr 


Arg 


Leu 


Pro 


Asp 


Lys 


225 










230 










235 










240 


Pro 


Leu 


Cys 


Gin 


His 


Met 


Pro 


Arg 


Leu 


His 


Trp 


Leu 


Asp 


Leu 


Glu 


Gly 










245 










250 










255 




Asn 


His 


He 


His 


Asn 


Leu 


Arg 


Asn 


Leu 


Thr 


Phe 


He 


Ser 


Cys 


Ser 


Asn 








260 










265 










270 






Leu 


Thr 


Val 


Leu 


Val 


Met 


Arg 


Lys 


Asn 


Lys 


He 


Asn 


His 


Leu 


Asn 


Glu 






275 










280 










285 








Asn 


Thr 


Phe 


Ala 


Pro 


Leu 


Gin 


Lys 


Leu 


Asp Glu 


Leu 


Asp 


Leu 


Gly 


Ser 




290 










295 










300 










Asn 


Lys 


He 


Glu 


Asn 


Leu 


Pro 


Pro 


Leu 


He 


Phe 


Lys 


Asp 


Leu 


Lys 


Glu 


305 










310 










315 










320 


Leu 


Ser 


Gin 


Leu 


Asn 


Leu 


Ser 


Tyr 


Asn 


Pro 


He 


Gin 


Lys 


He 


Gin 


Ala 










325 










330 










335 




Asn 


Gin 


Phe 


Asp 


Tyr 


Leu 


Val 


Lys 


Leu 


Lys 


Ser 


Leu 


Ser 


Leu 


Glu 


Gly 








340 










345 










350 






He 


Glu 


He 


Ser 


Asn 


He 


Gin 


Gin 


Arg 


Met 


Phe 


Arg 


Pro 


Leu 


Met 


Asn 






355 










360 










365 








Leu 


Ser 


His 


He 


Tyr 


Phe 


Lys 


Lys 


Phe 


Gin 


Tyr 


Cys 


Gly 


Tyr 


Ala 


Pro 




370 










375 










380 










His 


Val 


Arg 


Ser 


Cys 


Lys 


Pro 


Asn 


Thr 


Asp Gly 


He 


Ser 


Ser 


Leu 


Glu 


385 










390 










395 










400 


Asn 


Leu 


Leu 


Ala 


Ser 


He 


He 


Gin 


Arg 


Val 


Phe 


Val 


Trp 


Val 


Val 


Ser 










405 










410 










415 




Ala 


Val 


Thr 


Cys 


Phe 


Gly 


Asn 


He 


Phe 


Val 


He Cys 


Met 


Arg 


Pro 


Tyr 








420 










425 










430 






He 


Arg 


Ser 


Glu 


Asn 


Lys 


Leu 


Tyr 


Ala 


Met 


Ser 


He 


He 


Ser 


Leu 


Cys 






435 










440 










445 








Cys 


Ala 


Asp 


Cys 


Leu 


Met 


Gly 


He 


Tyr 


Leu 


Phe 


Val 


He 


Gly 


Gly 


Phe 




450 










455 










4 60 










Asp 


Leu 


Lys 


Phe 


Arg 


Gly 


Glu 


Tyr 


Asn 


Lys 


His 


Ala 


Gin 


Leu 


Trp 


Met 


465 










470 










475 










480 


Glu 


Ser 


Thr 


His 


Cys 


Gin 


Leu 


Val 


Gly 


Ser 


Leu 


Ala 


He 


Leu 


Ser 


Thr 










485 










490 










495 




Glu 


Val 


Ser 


Val 


Leu 


Leu 


Leu 


Thr 


Phe 


Leu 


Thr 


Leu 


Glu 


Lys 


Tyr 


He 








500 










505 










510 






Cys 


He 


Val 


Tyr 


Pro 


Phe 


Arg 


Cys 


Val 


Arg 


Pro Gly 


Lys 


Cys 


Arg 


Thr 






515 










520 










525 








He 


Thr 


Val 


Leu 


He 


Leu 


He 


Trp 


He 


Thr Gly Phe 


He 


Val 


Ala 


Phe 




530 










535 










540 










He 


Pro 


Leu 


Ser 


Asn 


Lys 


Glu 


Phe 


Phe 


Lys 


Asn 


Tyr 


Tyr 


Gly 


Thr 


Asn 


545 










550 










555 










560 


Gly 


Val 


Cys 


Phe 


Pro 


Leu 


His 


Ser 


Glu 


Asp 


Thr 


Glu 


Ser 


He 


Gly 


Ala 










565 










570 










575 




Gin 


He 


Tyr 


Ser 


Val 


Ala 


He 


Phe 


Leu 


Gly 


He 


Asn 


Leu 


Ala 


Ala 


Phe 








580 










585 










590 






He 


He 


He 


Val 


Phe 


Ser 


Tyr 


Gly 


Ser 


Met 


Phe 


Tyr 


Ser 


Val 


His 


Gin 



595 600 605 
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Ser 


Ala 
610 


He 


Thr 


Ala 


Thr 


Glu 
615 


He 


Arg 


Asn 


Gin 


Val 
620 


Lys 


Lys 


Glu 


Met 


lie 


Leu 


Ala 


Lys 


Arg 


Phe 


Phe 


Phe 


He 


Val 


Phe 


Thr 


Asp 


Ala 


Leu 


Cys 


625 










630 










635 










640 


Trp 


He 


Pro 


He 


Phe 


Val 


Val 


Lys 


Phe 


Leu 


Ser 


Leu 


Leu 


Gin 


Val 


Glu 








645 










650 










655 




He 


Pro 


Gly 


Thr 
660 


He 


Thr 


Ser 


Trp 


Val 
665 


Val 


He 


Phe 


He 


Leu 
670 


Pro 


He 


Asn 


Ser 


Ala 

675 


Leu 


Asn 


Pro 


He 


Leu 
680 


Tyr 


Thr 


Leu 


Thr 


Thr 

685 


Arg 


Pro 


Phe 


Lys 


Glu 
690 


Met 


He 


His 


Arg 


Phe 
695 


Trp 


Tyr 


Asn 


Tyr 


Arg 
700 


Gin 


Arg 


Lys 


Ser 


Met Asp 


Ser 


Lys 


Gly 


Gin 


Lys 


Thr 


Tyr 


Ala 


Pro 


Ser 


Phe 


He 


Trp 


Val 


705 










710 










715 










720 


Glu 


Met 


Trp 


Pro 


Leu 
725 


Gin 


Glu 


Met 


Pro 


Pro 
730 


Glu 


Leu 


Met 


Lys 


Pro 
735 


Asp 


Leu 


Phe 


Thr 


Tyr 
740 


Pro 


Cys 


Glu 


Met 


Ser 
745 


Leu 


He 


Ser 


Gin 


Ser 
750 


Thr 


Arg 


Leu 


Asn 


Ser 


Tyr 


Ser 

























755 



<210> 7 
<211> 3584 
<212> DNA 
<213> human 

<400> 7 

ctgctttgta actgctaaga ttgcagacag aaatagcaca caaccactgt gagctgtatg 60 

cgattcagaa accaagacca aattttgctc actttcatta atcagttgct cagatagaag 120 

gaaatgacat ctggttctgt cttcttctac atcttaattt ttggaaaata tttttctcat 180 

gggggtggac aggatgtcaa gtgctccctt ggctatttcc cctgtgggaa catcacaaag 240 

tgcttgcctc agctcctgca ctgtaacggt gtggacgact gcgggaatca ggccgatgag 300 

gacaactgtg tggtggtttt gtgccagtgc atgtctttgc caggtctgga gcttgactgg 360 

atgaaaccat ttacgagtgt tccatcggtt tcttcaaatg tgactgcaat gtcacttcag 420 

tggaacttaa taagaaagct tcctcctgat tgcttcaaga attatcatga tcttcagaag 480 

ctggacctgc aaaacaataa gattacatcc atctccatct atgctttcag aggactgaat 540 

agccttacta aactgtatct cagtcataac agaataacct tcctgaagcc gggtgttttt 600 

gaagatcttc acagactaga atggctgata attgaagata atcacctcag tcgaatttcc 660 

ccaccaacat tttatggact aaattctctt attctcttag tcctgatgaa taacgtcctc 720 

acccgtttac ctgataaacc tctctgtcaa cacatgccaa gactacattg gctggacctt 780 

gaaggcaacc atatccataa tttaagaaat ttgactttta tttcctgcag taatttaact 840 

gttttagtga tgaggaaaaa caaaattaat cacttaaatg aaaatacttt tgcacctctc 900 

cagaaactgg atgaattgga tttaggaagt aataagattg aaaatcttcc accgcttata 960 

ttcaaggacc tgaaggagct gtcacaattg aatctttcct ataatccaat ccagaaaatt 1020 

caagcaaacc aatttgatta tcttgtcaaa ctcaagtctc tcagcctaga agggattgaa 1080 

atttcaaata tccaacaaag gatgtttaga cctcttatga atctctctca catatatttt 1140 

aagaaattcc agtactgtgg gtatgcacca catgttcgca gctgtaaacc aaacactgat 1200 

ggaatttcat ctctagagaa tctcttggca agcattattc agagagtatt tgtctgggtt 1260 

gtatctgcag ttacctgctt tggaaacatt tttgtcattt gcatgcgacc ttatatcagg 1320 

tctgagaaca agctgtatgc catgtcaatc atttctctct gctgtgccga ctgcttaatg 1380 

ggaatatatt tattcgtgat cggaggcttt gacctaaagt ttcgtggaga atacaataag 1440 

catgcgcagc tgtggatgga gagtactcat tgtcagcttg taggatcttt ggccattctg 1500 

tccacagaag tatcagtttt actgttaaca tttctgacat tggaaaaata catctgcatt 1560 

gtctatcctt ttagatgtgt gagacctgga aaatgcagaa caattacagt tctgattctc 1620 

atttggatta ctggttttat agtggctttc attccattga gcaataagga atttttcaaa 1680 

aactactatg gcaccaatgg agtatgcttc cctcttcatt cagaagatac agaaagtatt 1740 

ggagcccaga tttattcagt ggcaattttt cttggtatta atttggccgc atttatcatc 1800 

atagtttttt cctatggaag catgttttat agtgttcatc aaagtgccat aacagcaact 1860 

gaaatacgga atcaagttaa aaaagagatg atccttgcca aacgtttttt ctttatagta 1920 
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tttactgatg cattatgctg gatacccatt tttgtagtga aatttctttc actgcttcag 1980 

gtagaaatac caggtaccat aacctcttgg gtagtgattt ttattctgcc cattaacagt 2040 

gctttgaacc caattctcta tactctgacc acaagaccat ttaaagaaat gattcatcgg 2100 

ttttggtata actacagaca aagaaaatct atggacagca aaggtcagaa aacatatgct 2160 

ccatcattca tctgggtgga aatgtggcca ctgcaggaga tgccacctga gttaatgaag 2220 

ccggaccttt tcacataccc ctgtgaaatg tcactgattt ctcaatcaac gagactcaat 2280 

tcctattcat gactgactct gaaattcatt tcttcgcaga gaatactgtg ggggtgcttc 2340 

atgagggatt tactggtatg aaaatgaata ccacaaaatt aatttataat aatagctaag 2400 

ataaatattt tacaaggaca tgaggaaaaa taaaaatgac taatgctctt acaaagggaa 24 60 

gtaattatat caataatgta tatatattag tagacatttt gcataagaaa ttaagagaaa 2520 

tctacttcag taacattcat tcatttttct aacatgcatt tattgagtac ccactactat 2580 

gtgcatagca ttgcaatata gtcctggaag tagacagtgc agaacctttc aatctgtaga 2640 

tagtgtttaa tgacaaaaga ctatacaaag tccatctgca gttcctagtt taaagtagag 2700 

ctttacctgt catgtgcatc agcaagaatc ataggcactt ttaaataaag gtttaaagtt 2760 

ttggaatact cagtgtattt gcatcataga aaatgtctga ctgtttgcaa aataatattc 2820 

tgttttaaga atccatctta cctctcttta agtttccata cacttgagag ccaacacaac 2880 

atatttatta ctaaaaagat gctttgctag aaactcaaaa acagcacttc ttttggcact 2940 

tcctgcccag ttttctcttt gctttaaatg aacatcatca tatggaattg gaataggaga 3000 

gtatgagtac ggcagagaag tggatcagaa aaactagaat gaggataaac atttacatta 3060 

gtggaaactc ctgaaataaa tccttgtatt gtcagttaac tgattttcaa caaggatgcc 3120 

aagacaaaaa ggcttttcaa caaaccgtgc tgttttaaga acagacctaa gtggtttaat 3180 

tcacccactt tagatgggtg aatgttatgg tgtgtgaaat atctcagtaa agcagttaaa 3240 

aggaaaaaga gctggaatgc actgattcag gaacttaatt tcaggaagga aaggtctgta 3300 

tgtacacatt tcactttaag cagaaaatct ttcttcaaga aatgacttta ctttctcttt 3360 

gcactgccag cacgtgagat actaactttt taactagttg ttcttctcta gtctctacgt 3420 

tattagnatt ttttgctttc ataatgtgaa acctttaagc aggagaagaa aatgttttca 3480 

gatagtttca aatacnccaa aaatgtttgc aacacaaaaa tactggaatc naaccataat 3540 

gcccttattg aatatatagt tgtatagntt tgttctgaaa accc 3584 

<210> 8 
<211> 722 
<212> PRT 
<213> human 

<400> 8 



Met Thr 


Ser 


Gly 


Ser 


Val 


Phe 


Phe 


Tyr 


He 


Leu 


He 


Phe Gly 


Lys 


Tyr 


1 






5 










10 








15 




Phe Ser 


His 


Gly 


Gly 


Gly 


Gin 


Asp 


Val 


Lys 


Cys 


Ser 


Leu Gly Tyr 


Phe 






20 










25 








30 






Pro Cys 


Gly 


Asn 


lie 


Thr 


Lys 


Cys 


Leu 


Pro 


Gin 


Leu 


Leu His 


Cys 


Asn 




35 










40 










45 






Gly Val 


Asp 


Asp 


Cys 


Gly 


Asn 


Gin 


Ala 


Asp 


Glu 


Asp 


Asn Cys Val Val 


50 










55 










60 








Val Leu 


Cys 


Gin 


Cys 


Met 


Ser 


Leu 


Pro 


Gly 


Leu 


Glu 


Leu Asp Trp Met 


65 








70 










75 








80 


Lys Pro 


Phe 


Thr 


Ser 


Val 


Pro 


Ser 


Val 


Ser 


Ser 


Asn 


Val Thr 


Ala 


Met 








85 










90 








95 




Ser Leu 


Gin 


Trp 


Asn 


Leu 


He 


Arg 


Lys 


Leu 


Pro 


Pro 


Asp Cys 


Phe 


Lys 






100 










105 








110 






Asn Tyr 


His 


Asp 


Leu 


Gin 


Lys 


Leu 


Asp 


Leu 


Gin 


Asn 


Asn Lys 


He 


Thr 




115 










120 










125 






Ser lie 


Ser 


lie 


Tyr 


Ala 


Phe 


Arg 


Gly 


Leu 


Asn 


Ser 


Leu Thr 


Lys 


Leu 


130 










135 










140 








Tyr Leu 


Ser 


His 


Asn 


Arg 


He 


Thr 


Phe 


Leu 


Lys 


Pro 


Gly Val 


Phe 


Glu 


145 








150 










155 








160 


Asp Leu 


His 


Arg 


Leu 


Glu 


Trp 


Leu 


He 


He 


Glu 


Asp 


Asn His 


Leu 


Ser 








165 










170 








175 




Arg lie 


Ser 


Pro 


Pro 


Thr 


Phe 


Tyr 


Gly 


Leu 


Asn 


Ser 


Leu He 


Leu 


Leu 



180 185 190 
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Val 


Leu 


Met 


Asn 


Asn 


Val 


Leu 


Thr 


Arg 


Leu 


Pro 


Asp 


Lys 


Pro 


Leu 


Cys 






195 










200 










205 






Gin 


His 


Met 


Pro 


Arg 


Leu 


His 


Trp 


Leu 


Asp 


Leu 


Glu 


Gly 


Asn 


His 


He 




210 










215 










220 










His 


Asn 


Leu 


Arg 


Asn 


Leu 


Thr 


Phe 


lie 


Ser 


Cys 


Ser 


Asn 


Leu 


Thr 


Val 


225 










230 










235 










240 


Leu 


Val 


Met 


Arg 


Lys 


Asn 


Lys 


He 


Asn 


His 


Leu 


Asn 


Glu 


Asn 


Thr 


Phe 










245 










250 










255 




Ala 


Pro 


Leu 


Gin 


Lys 


Leu 


Asp 


Glu 


Leu 


Asp 


Leu 


Gly 


Ser 


Asn 


Lys 


He 








260 










265 










270 






Glu 


Asn 


Leu 


Pro 


Pro 


Leu 


He 


Phe 


Lys 


Asp 


Leu 


Lys 


Glu 


Leu 


Ser 


Gin 






275 










280 










285 








Leu 


Asn 


Leu 


Ser 


Tyr 


Asn 


Pro 


He 


Gin 


Lys 


He 


Gin 


Ala 


Asn 


Gin 


Phe 




290 










295 










300 










Asp 


Tyr 


Leu 


Val 


Lys 


Leu 


Lys 


Ser 


Leu 


Ser 


Leu 


Glu 


Gly 


He 


Glu 


He 


305 










310 










315 










320 


Ser 


Asn 


He 


Gin 


Gin 


Arg 


Met 


Phe 


Arg 


Pro 


Leu 


Met 


Asn 


Leu 


Ser 


His 










325 










330 










335 




He 


Tyr 


Phe 


Lys 


Lys 


Phe 


Gin 


Tyr 


Cys 


Gly 


Tyr 


Ala 


Pro 


His 


Val 


Arg 








340 










345 










350 






Ser 


Cys 


Lys 


Pro 


Asn 


Thr 


Asp 


Gly 


He 


Ser 


Ser 


Leu 


Glu 


Asn 


Leu 


Leu 






355 










360 










365 








Ala 


Ser 


He 


He 


Gin 


Arg 


Val 


Phe 


Val 


Trp 


Val 


Val 


Ser 


Ala 


Val 


Thr 




370 










375 










380 










Cys 


Phe 


Gly 


Asn 


He 


Phe 


Val 


He 


Cys 


Met 


Arg 


Pro 


Tyr 


He 


Arg 


Ser 


385 










390 










395 










400 


Glu 


Asn 


Lys 


Leu 


Tyr 


Ala 


Met 


Ser 


He 


He 


Ser 


Leu 


Cys 


Cys 


Ala 


Asp 










405 










410 










415 




Cys 


Leu 


Met 


Gly 


He 


Tyr 


Leu 


Phe 


Val 


He 


Gly 


Gly 


Phe 


Asp 


Leu 


Lys 








420 










425 










430 






Phe 


Arg 


Gly 


Glu 


Tyr 


Asn 


Lys 


His 


Ala 


Gin 


Leu 


Trp 


Met 


Glu 


Ser 


Thr 






435 










440 










445 








His 


Cys 


Gin 


Leu 


Val 


Gly 


Ser 


Leu 


Ala 


He 


Leu 


Ser 


Thr 


Glu 


Val 


Ser 




450 










455 










460 










Val 


Leu 


Leu 


Leu 


Thr 


Phe 


Leu 


Thr 


Leu 


Glu 


Lys 


Tyr 


He 


Cys 


He 


Val 


465 










470 










475 










480 


Tyr 


Pro 


Phe 


Arg 


Cys 


Val 


Arg 


Pro 


Gly 


Lys 


Cys 


Arg 


Thr 


He 


Thr 


Val 










485 










490 










495 




Leu 


He 


Leu 


He 


Trp 


He 


Thr 


Gly 


Phe 


He 


Val 


Ala 


Phe 


He 


Pro 


Leu 








500 










505 










510 






Ser 


Asn 


Lys 


Glu 


Phe 


Phe 


Lys 


Asn 


Tyr 


Tyr 


Gly 


Thr 


Asn 


Gly 


Val 


Cys 






515 










520 










525 








Phe 


Pro 


Leu 


His 


Ser 


Glu 


Asp 


Thr 


Glu 


Ser 


lie 


Gly 


Ala 


Gin 


He 


Tyr 




530 










535 










540 










Ser 


Val 


Ala 


He 


Phe 


Leu 


Gly 


He 


Asn 


Leu 


Ala 


Ala 


Phe 


He 


He 


He 


545 










550 










555 










560 


Val 


Phe 


Ser 


Tyr 


Gly 


Ser 


Met 


Phe 


Tyr 


Ser 


Val 


His 


Gin 


Ser 


Ala 


He 










565 










570 










575 




Thr 


Ala 


Thr 


Glu 


He 


Arg 


Asn 


Gin 


Val 


Lys 


Lys 


Glu 


Met 


He 


Leu 


Ala 








580 










585 










590 






Lys 


Arg 


Phe 


Phe 


Phe 


He 


Val 


Phe 


Thr 


Asp 


Ala 


Leu 


Cys 


Trp 


He 


Pro 






595 










600 










605 








He 


Phe 


Val 


Val 


Lys 


Phe 


Leu 


Ser 


Leu 


Leu 


Gin 


Val 


Glu 


He 


Pro 


Gly 




610 










615 










620 










Thr 


He 


Thr 


Ser 


Trp 


Val 


Val 


He 


Phe 


He 


Leu 


Pro 


He 


Asn 


Ser 


Ala 


625 










630 










635 










640 


Leu 


Asn 


Pro 


He 


Leu 


Tyr 


Thr 


Leu 


Thr 


Thr 


Arg 


Pro 


Phe 


Lys 


Glu 


Met 










645 










650 










655 
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He His Arg Phe Trp Tyr Asn Tyr Arg Gin Arg Lys Ser Met Asp Ser 

660 665 670 

Lys Gly Gin Lys Thr Tyr Ala Pro Ser Phe He Trp Val Glu Met Trp 

675 680 685 

Pro Leu Gin Glu Met Pro Pro Glu Leu Met Lys Pro Asp Leu Phe Thr 

690 695 700 

Tyr Pro Cys Glu Met Ser Leu He Ser Gin Ser Thr Arg Leu Asn Ser 
705 710 715 720 

Tyr Ser 



12 



INTERNATIONAL SEARCH REPORT 



iDtemational appitcatioii No. 
PCTAJS99/06573 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C07K 14/70S; C12N 15/12, 15/63, 15/70, 15/79 

US CL :530/3SO; 435/69.1. 252 J, 254.11. 320.1, 325 
According to Interaatiopel Patent ClaniCcatioo QPC) or to both natioBal claswfication and IPC 

a FIELDS SEARCHED 

MiouDum documentatioo searched (classiiicatioo system followed by classification symbols) 

U.S. : 530/350; 435/69.1, 252 J. 254.1 1. 320.1, 325 



Documeatalion searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS, Biosis, Medline* WPI 

search tenns: 0«protein coupled receptor. Leucine rich repeats, Oonadotropm receptor. Thyrotropin receptor 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Categoiy^ 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 

X.P 
Y,P 

X 
Y 



US 5,614,363 A (CONE) 25 Match 1997, entire document. 

US 5,858,716 A (ELSHOURBAGY et al.) 12 January 1999, 
columns 20-30. entire document. 



HWANG et al. Analysis of Expressed Sequence Tags from a Fetal 
Human Heart cDNA Library. Genomics. 1995, Vol. 30, pages 
293-298, entire document. 



1-11 
11 

1-10 

5,7, 11 
1-4, 6, 8-10 



j x| Further documents are listed in the continuation of Box C. | | See patent family annex. 



NlUUMdl 



dommait dilinins tfa« g«Mnl • 
tolMofpaiticultf nkvioM 



• ef llM «t whieh « not ooQMdtnd 

Mr liar document publtabad oo or sAor tha tntMnstjonal filing data 

doouncnt whidi nwf tfafov d<wil>Ci oo pnovity oluni(t) or whiob it 
ouM to HIP inn iM puoBonao out en inntiwr bctd oo or oimt 

doouewt itftirint ^ » on! d b o lwi a% um» odifcitkm or othv 

doeunoot piAliihod prior 10 dM iotmiatiocMl (Ui^ 



•X* 



dtli nd oot m oouflifif with tbo tppUothoo but oitod lo oDdnlHid 
fhit pmiBipIo or ttitofy mdcrlyins tbo nivosdoo 

document of partieulv ralwvK*; tho oluaod tnvMitiao oooDot bo 
soQiadarad Dov«l or canaol be ooukiorod to ievoho tt iovonliro itip 
wbio dw docwn»t • tikon alaoa 

doottm«ot of poitioular nlevKioe; the ofaumod iu f ou t i o o oosnoi ba 
oontiderid to involvo in inirootivo itop wboo tfao doouneot it 
oomWaod with ono or moro olhar Mich dooumonbu aoeh ooab i n rt ioo 
btins obvioui lo o ponoo Bkillod b tho «it 



Date of the actual completion of the international search 
11 JUNE 1999 


Date of mailing of the interoational search report 

0 2 AUG 1999 


Name and mailing address of the ISAAJS 
Commitiioner of Patents and lYidenurks 

Box per 

Washington. D.C. 20231 
Facsimile No. (703) 305-3230 


Authort^ officer ^ 
Telephone No. (703) 308-0 196 



Fonn PCT/ISA/210 (second sheetXiuly 1992)* 



INTERNATIONAL SEARCH REPORT 



InteroationaJ application No. 
PCT/US99/06573 



BOX U. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I, claims Ml. drawn to nucleic acids encoding LGR4. the L0R4 polypeptide, and method of using the LGR4 
nucleic acid. 

Group II, claims 1-11, drawn to nucleic acids encoding LGR5, the LORS polypeptide and metiiod of using the LORS 
nucleic acid. 

Group III. claims 1*11. drawn to nucleic acid encoding LGR6. the LX)R6 polypeptide, and method of using the LGR6 
nucleic acid. 

Group IV. claims 12 and 13, drawn to antibody that binds to LGR4. 
Group V. claims 12 and 13. drawn to antibody that binds to LORS. 
Group VI. claims 12 and 13. drawn to antibody that binds to LGR7. 

Group VII, claims 14-17, drawn to transgenic animal model containmg an altered LGR4 gene. 
Group VIII, claims 14-17, drawn to transgenic animal model containing an altered LGR5 gene 
Group IX, claims 14-17, drawn to transgenic animal model containing an altered LGR7 gene 
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Each of the claims 1-18 is in tiiree diCferent groups because LGR4, LGR5, and LGR7 aro strocturally and fiinctionally 
distinct polypeptides. 

The mveatioos listed as Groups 1-XII do not relate to a single inventive concept under PCT Rule 13.1 because, under 
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amino acid sequence of LGR4. The special technical feature of Group V is die antibody that binds to LGR5 but does 
not have the amino acid sequence of LGR5. The special technical feature of Group VI is the antibody that binds to 
LGR6 but does not have the amino acid sequence of LGR6. The special technical feature of Group VII is a transgenic 
animal containing an altered LGR4 gene. The special technical feature of Group VIII is a transgenic animal containing 
ao altered LGRS gene. The special technical (eatura of Group IX is a transgenic animal containmg an altered LGR7 
gene. The special technical feature of Group X is a method of screening for a ligand that binds LOR4. 
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